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SIGNAL TRANSDUCTION GENES AND METHODS OF USE 



TECHNICAL FIELD 

The present invention relates generally to plant molecular biology. More 
5 specifically, it relates to nucleic acids and methods for modulating their expression in 
plants. 



BACKGROUND OF THE INVENTION 

Cells are constantly bombarded by external signals that regulate their growth, 

10 differentiation, and stress level. To respond properly to these signals, eukaryotic cells 

assemble cascades of highly conserved protein kinases (mitogen-activated protein kinases, 
MAPKs, and their activator kinases), which form the central elements of signal 
transduction pathways that lead to and activate transcription factors in the nucleus and 
other effectors throughout the celL (Elion, E. A. Routing MAP Kinase Cascades, 

15 Science, Vol. 281 (11 September 1998), pp. 1625-1626) 

Plants' sessile nature requires strategies to adapt to changes in their local 
envirormient. Responses to signals such as pathogen attack, herbivore feeding, or changes 
in light, temperature, or nutrient availability often involve phytohormones such as 
ethylene. MAP kinase modules appear to be involved in ethylene signaling and auxin- 

20 induced cell proliferation. Upstream and downstream regulatory components suggest that 
MAP kinases may serve as a central point in a signal transduction network where different 
upstream pathways converge and may distribute the signals to different downstream 
targets. (Jonak, C, et al. MAP kinases: universal multi-purpose signaling tools. Plant 
Molecular Biology Vol. 24 (1 994), pp. 407-416.) 

25 

DESCRIPTION OF THE INVENTION 
Overview 

Generally, it is the object of the present invention to provide nucleic acids and 
proteins relating to signal transduction genes. It is an object of the present invention to 
30 provide transgenic plants comprising the nucleic acids of the present invention, and 
methods for modulating, in a transgenic plant, expression of the nucleic acids of the 
present invention. 

Therefore, in one aspect the present invention relates to an isolated nucleic acid 
comprising a member selected from the group consisting of (a) a poljmucleotide having a 
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specified sequence identity to a pol>Tiucleotide encoding a polypeptide of the present 
invention; (b) a polynucleotide which is complementary to the polynucleotide of (a); and, 
(c) a polynucleotide comprising a specified number of contiguous nucleotides from a 
polynucleotide of (a) or (b). The isolated nucleic acid can be DNA. 
5 In other aspects the present invention relates to: 1) recombinant expression 

cassettes, comprising a nucleic acid of the present invention operably linked to a promoter, 
2) a host cell into which has been introduced the recombinant expression cassette, and 3) a 
transgenic plant comprising the recombinant expression cassette. The host cell and plant 
are optionally from maize, wheat, rice, or soybean. 
10 Unless otherwise stated, the polynucleotide and polypeptide sequences identified in 

the accompanying Sequence Listing represent polynucleotides and polypeptides of the 
present invention. A nucleic acid of the present invention comprises a polynucleotide of 
the present invention. A protein of the present invention comprises a polypeptide of the 
present invention. 

15 The present invention provides utility in such exemplary applications as regulating 

expression of phytohormones, including ethylene, auxins, cytokinins, and gibberellin, in 
order to effect developmental changes in plants and provide control of plant response to 
environmental stresses. Various preferred embodiments are disclosed throughout the 
specification. 

20 

Definitions 

Units, prefixes, and symbols may be denoted in their SI accepted form. Unless 
otherwise indicated, nucleic acids are written left to right in 5* to 3' orientation; amino acid 
sequences are written left to right in amino to carboxy orientation, respectively. Numeric 

25 ranges recited within the specification are inclusive of the numbers defining the range and 
include each integer within the defined range. Amino acids may be referred to herein by 
either their commonly known three letter symbols or by the one-letter symbols 
recommended by the lUPAC-IUBMB Nomenclature Commission. Nucleotides, likewise, 
may be referred to by their commonly accepted single-letter codes. Unless otherwise 

30 provided for, software, electrical, and electronics terms as used herein are as defined in 

The New IEEE Standard Dictionary of Electrical and Electronics Terms (5^*" edition, 1993). 
The terms defined below are more fully defined by reference to the specification as a 
whole. Section headings provided throughout the specification are not limitations to the 
various objects and embodiments of the present invention. 
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By "amplified" is meant the construction of nuiHiple copies of a nucleic acid 
sequence or multiple copies complementary to the nucleic acid sequence using at least one 
of the nucleic acid sequences as a template. .Amplification systems include the polymerase 
chain reaction (PCR) system, ligase chain reaction (LCR) system, nucleic acid sequence 
based ampHfication (NASBA, Cangene, Mississauga, Ontario), Q-Bera Replicase systems, 
transcription-based amplification system (TAS), and strand displacement amplification 
(SDA). See, e.g., Diagnostic Molecular Microbiology': Principles and Applications, D. H. 
Persing et al, Ed., American Society for Microbiology, Washington, D.C. (1993). The 
product of amplification is termed an amplicon. 

As used herein, "antisense orientation" includes reference to a duplex 
polynucleotide sequence that is operably linked to a promoter in an orientation where the 
antisense strand is transcribed. The antisense strand is sufficiently complementary to an 
endogenous transcription product such that translation of the endogenous transcription 
product is often inhibited. 

By "encoding" or "encoded", with respect to a specified nucleic acid, is meant 
comprising the information for translation into the specified protein. A nucleic acid 
encoding a protein may comprise non-translated sequences (e.g., introns) within translated 
regions of the nucleic acid, or may lack such intervening non-translated sequences (e.g., as 
in cDNA). The information by which a protein is encoded is specified by the use of 
codons. Typically, the amino acid sequence is encoded by the nucleic acid using the 
"universal" genetic code. However, variants of the universal code, such as are present in 
some plant, animal, and fiangal mitochondria, the bacterium Mycoplasma capricolum, or 
the ciliate Macronucleus, may be used when the nucleic acid is expressed therein. 

When the nucleic acid is prepared or altered synthetically, advantage can be taken 
of known cpdqn preferences of the intended host where the nucleic aedd is to be expressed. 
For example, although nucleic acid sequences of the present invention may be expressed in 
both monocotyledonous and dicotyledonous plant species, sequences can be modified to 
account for the specific codon preferences and GC content preferences of monocotyledons 
or dicotyledons as these preferences have been shown to differ (Murray et al Nucl Acids 
Res. 17: 477-498 (1989)). Thus, the maize preferred codon for a particular amino acid 
may be derived from known gene sequences from maize. Maize codon usage for 28 genes 
from maize plants is listed in Table 4 of Murray et al, supra. 

As used herein "full-length sequence" in reference to a specified polynucleotide or 
its encoded protein means having the entire amino acid sequence of, a native, (non- 
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synthetic), endogenous, biologidaHy (e.g., structurally or catalytically) active form of the 
specified protein. Methods to determine whether a sequence is full-length are well known 
in the art including such exemplary techniques as northern or western blots, primer 
extension, SI protection, and ribonuclease protection. See, e.g.. Plant Molecular Biology: 
5 A Laboratory Manual^ Clark, Ed., Springer-Verlag, Berlin ( 1 997). Comparison to known 
full-length homologous (orthologous and/or paralogous) sequences can also be used to 
identify full-length sequences of the present invention. Additionally, consensus sequences 
typically present at the 5' and 3' untranslated regions of mRNA aid in the identification of 
a polynucleotide as full-length. For example, the consensus sequence ANNNNAUGG, 
10 where the underlined codon represents the N-terminal methionine, aids in determining 
whether the polynucleotide has a complete 5' end. Consensus sequences at the 3' end, 
such as polyadenylation sequences, aid in determining whether the polynucleotide has a 
complete 3' end. 

As used herein, "heterologous" in reference to a nucleic acid is a nucleic acid that 

15 originates from a foreign species, or, if from the same species, is substantially modified 
from its native form in composition and/or genomic locus by human intervention. For 
example, a promoter operably linked to a heterologous structural gene is from a species 
different from that from which the structural gene was derived, or, if from the same 
species, one or both are substantially modified from their original form. A heterologous 

20 protein may originate from a foreign species or, if from the same species, is substantially 
modified from its original form by human intervention. 

By "host cell" is meant a cell which contains a vector and supports the replication 
and/or expression of the vector. Host cells may be prokaryotic cells such as E, coli, or 
eukaryotic cells such as yeast, insect, amphibian, or mammalian cells. Preferably, host 

25 cells are monocdtyledonous or dicotyledonous plant cells. A particularly preferA*ed 
monocotyledonous host cell is a maize host cell. 

The term "introduced" includes reference to the incorporation of a nucleic acid into 
a eukaryotic or prokaryotic cell where the nucleic acid may be incorporated into the 
genome of the cell (e.g., chromosome, plasmid, plastid or mitochondrial DNA), converted 

30 into an autonomous replicon, or transiently expressed (e.g., transfected mRNA). The term 
includes such nucleic acid introduction means as "transfection", "transformation" and 
*transduction". 

The term "isolated" refers to material, such as a nucleic acid or a protein, which is: 
(1) substantially or essentially free from components which normally accompany or 
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interact with it as found in its natural en% ironment:- The i'solated material optionally 
comprises material not found with the material in its natural environment; or (2) if the 
material is in its natural enviromnent, the material has been synthetically altered or 
synthetically produced by deliberate human intervention and/or placed at a different 
5 location within the cell. The synthetic alteration or creation of the material can be 

performed on the material within or apart from its natural state. For example, a naturally- 
occurring nucleic acid becomes an isolated nucleic acid if it is altered or produced by non- 
natural, synthetic methods, or if it is transcribed from DNA which has been altered or 
produced by non-natural, synthetic methods. The isolated nucleic acid may also be 

10 produced by the synthetic re-arrangement ("shuffling") of a part or parts of one or more 
allelic forms of the gene of interest. Likewise, a naturally-occurring nucleic acid (e.g., a 
promoter) becomes isolated if it is introduced to a different locus of the genome. Nucleic 
acids which are "isolated," as defined herein, are also referred to as "heterologous" nucleic 
acids. See, e.g.. Compounds and Methods for Site Directed Mutagenesis in Eukaryotic 

1 5 Cells, Kmiec, U.S. Patent No. 5,565,350; /// Vivo Homologous Sequence Targeting in 
Eukaryotic Cells; Zarling et al , WO 93/22443; Methods for In Vitro Recombination, 
Stemmer, U.S. Patent No. 5,605,793. : - 

As used herein, "nucleic acid" includes reference to a deoxyribonucleotide or 
ribonucleotide polymer, or chimeras thereof, in either single- or double-stranded form, and 

20 unless otherwise limited, encompasses known analogues having the essential nature of 
natural nucleotides in that they hybridize to single-stranded nucleic acids in a manner 
similar to naturally occurring nucleotides (e.g., peptide nucleic acids). . 

Unless otherwise stated, a "signal transduction nucleic acid" is a nucleic acid of the 
present invention and means a nucleic acid comprising a polynucleotide of the present 

25 invention- (a "signal transduction polynucleotide") encoding a signal transduction 

polypeptide. A "signal transduction gene" is a gene of the present invention and refers to a 
full-length signal transduction polynucleotide. 

. By "nucleic acid library" is meant a collection of isolated DNA or RNA molecules 
which comprise and substantially represent the entire transcribed fraction of a genome of a 

30 specified organism, tissue, or of a cell type fi-om that organism. Construction of exemplary 
nucleic acid libraries, such as genomic and cDNA libraries, is taught in standard molecular 
biology references such as Berger and Kimmel, Guide to Molecular Cloning Techniques, 
Methods in Enzymology, Vol. 152, Academic Press, Inc., San Diego, CA (Berger); 
Sambrook et al.. Molecular Cloning - A Laboratoiy Manual 2nd ed.. Vol. 1 -3 (1989); and 
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Current Protocols in Molecular Biology , F:M. Ausubel et ciL, Eds., Current Protocols, a 
joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. 
(1994). 

As used herein "operably linked*' includes reference to a functional linkage 
5 between a promoter and a second sequence, wherein the promoter sequence initiates and 
mediates transcription of the DNA sequence corresponding to the second sequence. 
Generally, operably linked means that the nucleic acid sequences being linked are 
contiguous and, where necessary to join two protein coding regions, contiguous and in the 
same reading frame. 

10 As used herein, the term "plant*' includes reference to whole plants, plant parts or 

organs (e.g., leaves, stems, roots, etc.), plant cells, seeds and progeny of same. Plant cell, 
as used herein, further includes, without limitation, cells obtained from or found in: seeds, 
suspension cultures, embryos, meristematic regions, callus tissue, leaves, roots, shoots, 
gametophytes, sporophytes, pollen, and microspores. Plant cells can also be understood to 

15 include modified cells, such as protoplasts, obtained from the aforementioned tissues. The 
class of plants which can be used in the methods of the invention is generally as broad as 
the class of higher plants amenable to transformation techniques, including both 
monocotyledonous and dicotyledonous plants. A particularly preferred plant is Zea mays. 
As used herein, "polynucleotide" includes reference to a deoxyribopolynucleotide, 

20 ribopolynucleotide, or chimeras or analogs thereof that have the essential nature of a 
natural deoxy- or ribo- nucleotide in that they hybridize, under stringent hybridization 
conditions, to substantially the same nucleotide sequence as naturally occurring 
nucleotides and^or allow translation into the same amino acid(s) as the naturally occurring 
nucleotide(s). A polynucleotide can be full-length or a subsequence of a native or 

25 heterologous structural or regulatory gene. Unless otherwise indicated, the tenii includes 
reference to the specified sequence as well as the complementary sequence thereof. Thus, 
DNAs or RNAs with backbones modified for stability or for other reasons are 
"polynucleotides" as that term is intended herein. Moreover, DNAs or RNAs comprising 
unusual bases, such as inosine, or modified bases, such as tritylated bases, to name just two 

30 examples, are polynucleotides as the term is used herein. It will be appreciated that a great 
variety of modifications have been made to DNA and RNA that serve many useful purposes 
known to those of skill in the art. The term polynucleotide as it is employed herein embraces 
such chemically, enzymatically or metabolically modified forms of polynucleotides, as well 
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as the chemical fomis of DNA and RNA'characterislic of viruses and cells, including among 
other things, simple and complex cells. 

The terms "polypeptide", "peptide" and "protein" are used interchangeably herein 
to refer to a polymer of amino acid residues. The terms apply to amino acid polymers in 
5 which one or more amino acid residue is an anificial chemical analogue of a 

corresponding naturally occurring amino acid, as well as to naturally occurring amino acid 
polymers. The essential nature of such analogues of naturally occurring amino acids is 
that, when incorporated into a protein, that protein is specifically reactive to antibodies 
elicited to the same protein but consisting entirely of naturally occurring amino acids. The 

10 terms "polypeptide", "peptide" and "protein" are also inclusive of modifications including, 
but not limited to, glycosylation, lipid attachment, sulfation, gamma-carboxylation of 
glutamic acid residues, hydroxylation and ADP-ribosylation. Further, this invention 
contemplates the use of both the methionine-containing and the methionine-less amino 
terminal variants of the protein of the invention. 

15 As used herein "promoter" includes reference to a region of DNA upstream from 

the start of transcription and involved in recognition and binding of RNA polymerase and 
other proteins to initiate transcription. A "plant promoter" is a promoter capable of 
initiating transcription in plant cells whether or not its origin is a plant cell. Exemplary 
plant promoters include, but are not limited to, those that are obtained from plants, plant 

20 viruses, and bacteria which comprise genes expressed in plant cells such as Agj-obacterium 
or Rhizohitim, Examples of promoters under developmental control include promoters that 
preferentially initiate transcription in certain tissues, such as leaves, roots, or seeds. Such 
promoters arc referred to as "tissue preferred". Promoters which initiate transcription only 
in certain tissue are referred to as "tissue specific", A "cell type" specific prompter 

25 primari-Kvdri\ cs expression in certain cell types in one or more organs^, .^pr example, 

vascular cells in roots or leaves. An "inducible" or "repressible" promoter is a promoter 
which is under environmental control. Examples of environmental conditions that may 
effect iranscnption by inducible promoters include anaerobic conditions or the presence of 
light. Tissue specific, tissue preferred, cell type specific, and inducible promoters 

30 constitute the class of "non-constitutive" promoters. A "constitutive" prompter is a 
promoter which is active under most environmental conditions. 

As used herein "recombinant", includes reference to a cell or vector, that has been 
modified by the introduction of a heterologous nucleic acid or that the cell is derived from 
a cell so modified. Thus, for example, recombinant cells express genes that are not found 
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in identical form within the.native (non-recombinant) fomi of the cell or express native 
genes that are otherwise abnormally expressed, under-expressed or not expressed at all as a 
result of human intervention. The temi ''recombinant" as used herein does not encompass 
the alteration of the cell or vector by naturally occurring events (e.g., spontaneous 
5 mutation, natural transformation/transductioa^ransposition) such as those occurring 
w^ithout human inter\'ention. 

As used herein, a "recombinant expression cassette" is a nucleic acid construct, 
generated recombinantly or synthetically, with a series of specified nucleic acid elements 
which permit transcription of a particular nucleic acid in a host cell. The recombinant 

10 expression cassette can be incorporated into a plasmid, chromosome, mitochondrial DNA, 
plastid DNA, virus, or nucleic acid fragment. Typically, the recombinant expression 
cassette portion of an expression vector includes, among other sequences, a nucleic acid to 
be transcribed, and a promoter. 

The term "residue" or "amino acid residue" or "amino acid" are used 

15 interchangeably herein to refer to an amino acid that is incorporated into a protein, 
polypeptide, or peptide (collectively "protein"). The amino acid may be a naturally 
occurring amino acid and, unless otherwise limited, may encompass non-natural analogs of 
natural amino acids that can function in a similar manner as naturally occurring amino 
acids. 

20 The term "selectively hybridizes" includes reference to hybridization, under 

stringent hybridization conditions, of a nucleic acid sequence to a specified nucleic acid 
target sequence to a detectably greater degree (e.g., at least 2-fold over background) than 
its hybridization to non-target nucleic acid sequences and to the substantial exclusion of 
non-target nucleic acids. Selectively hybridizing sequences typically have about at least 

25 80% sequence identity, preferably 90% sequence identity, and most preferably 100% 
sequence identity (i:e., complementary) with each other. 

The term "stringent conditions" or "stringent hybridization conditions" includes 
reference to conditions under which a probe will selectively hybridize to its target 
sequence, to a detectably greater degree than to other sequences (e.g., at least 2-fold over 

30 background). Stringent conditions are sequence-dependent and will be different in 

different circumstances. By controlling the stringency of the hybridization and/or washing 
conditions, target sequences can be identified which are 100% complementary to the probe 
(homologous probing). Alternatively, stringency conditions can be adjusted to allow some 
mismatching in sequences so that lower degrees of similarity are detected (heterologous 
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stringency of hybridization and/or wash solutions are inherently described. If the desired 
degree of mismatching results in a of less than 45 X (aqueous solution) or 32 °C 
(formamide solution) it is preferred to increase the SSC concentration so that a higher 
temperature can be used. Hybridization and'or wash conditions can be applied for at least 
5 10, 30, 60, 90, 120, or 240 minutes. An extensive guide to the hybridization of nucleic 

acids is found in Tijssen, Laboratory^ Techniques in Biochemistry and Molecular Biology- 
Hybridization with Nucleic Acid Probes. Part I, Chapter 2 "Overview of principles of 
hybridization and the strategy of nucleic acid probe assays*', Elsevier, New York (1993); 
and Current Protocols in Molecular Biology, Chapter 2, Ausubel, et al.^ Eds., Greene 

10 Publishing and Wiley- Interscience, New York (1995). 

As used herein, "transgenic plant" includes reference to a plant which comprises 
within its genome a heterologous polynucleotide. Generally, the heterologous 
polynucleotide is stably integrated within the genome such that the polynucleotide is 
passed on to successive generations. The heterologous polynucleotide may be integrated 

15 into the genome alone or as part of a recombinant expression cassette. "Transgenic" is used 
herein to include any cell, cell line, callus, tissue, plant part or plant, the genotype of which 
has been altered by the presence of heterologous nucleic acid including those transgenics 
initially so altered as well as those created by sexual crosses or asexual propagation from 
the initial transgenic. The term "transgenic" as used herein does not encompass the 

20 alteration of the genome (chromosomal or extra-chromosomal) by conventional plant 

breeding methods or by naturally occurring events such as random cross- fertilization,mpn- 
recombinant viral infection, non-recombinant bacterial transformation, non-recombinant 
transposition, or spontaneous mutation. 

As used herein, "vector" includes reference to a nucleic acid used in introduction of 

25 a polynucleotide of the present invention into a host cell. Vectors are-often replicons. 
Expression vectors permit transcription of a nucleic acid inserted therein. 

The following terms are used to describe the sequence relationships between a 
polynucleotide/polypeptide of the present invention with a reference . ^ 
polynucleotide/polypeptide: (a) "reference sequence", (b) "comparison window", (c) 

30 "sequence identity", and (d) "percentage of sequence identity". 

(a) As used herein, "reference sequence" is a defined sequence used as a basis for 
sequence comparison with a polynucleotide/polypeptide of the present invention. A 
reference sequence may be a subset or the entirety of a specified sequence; for example, as 



BNSDOCID: <WO 0070059A2_I_> 



wo 00/70059 PCT/USOO/1 1 687 

- 9 - 

probing). Generally, a probe is less -than about 1000 nucleotides in length, optionally less 
than 500 nucleotides in length. 

Typically, stringent conditions will be those in which the salt concentration is less 
than about 1 .5 M Na ion, typically about 0.01 to 1 .0 M Na ion concentration (or other 
5 salts) at pH 7.0 to 8.3 and the temperature is at least about 30*^0 for short probes {e,g.. 10 
to 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. Exemplary low stringency conditions include hybridization 
with a buffer solution of 30 to 35% formamide, 1 M NaCl, 1% SDS (sodium dodecyl 

1 0 sulphate) at 37X, and a wash in IX to 2X SSC (20X SSC = 3.0 M NaCl/0.3 M trisodium 
citrate) at 50 to 55^C. Exemplary moderate stringency conditions include hybridization in 
40 to 45% formamide, 1 M NaCl, 1% SDS at 37°C, and a wash in 0.5X to IX SSC at 55 to 
60^C. Exemplary high stringency conditions include hybridization in 50% formamide, 1 
M NaCl, 1% SDS at 37°C, and a wash in 0.1 X SSC at 60 to 65°C. 

15 Specificity is typically the function of post-hybridization washes, the critical 

factors being the ionic strength and temperature of the final wash solution. For DNA- 
DNA hybrids, the can be approximated from the equation of Meinkoth and Wahl, Aiial. 
Biochenu, 138:267-284(1984): Tn. = 81.5 ^C + 16.6 (log M) + 0.41 (%GC) - 0.61 (% 
form) - 500/L; where M is the molarity of monovalent cations, %GC is the percentage of 

20 guanosine and cytosine nucleotides in the DNA, % form is the percentage of formamide in 
the hybridization solution, and L is the length of the hybrid in base pairs. The Tm is the 
temperature (under defined ionic strength and pH) at which 50% of a complementary 
target sequence hybridizes to a perfectly matched probe. Tm is reduced by about 1°C for 
each 1% of mismatching; thus, T^, hybridization and/or wash conditions can be adjusted 

25 to hybridize to sequences of the desired identity. For example, if sequences with >90% 
identity are sought, the Tm can be decreased 10°C. Generally, stringent conditions are 
selected to be about 5°C lower than the thermal melting point (Tm) for the specific 
sequence and its complement at a defined ionic strength and pH. However, severely 
stringent conditions can utilize a hybridization and/or wash at 1, 2, 3, or 4 ^C lower than 

30 the thermal melting point (Tm); moderately stringent conditions can utilize a hybridization 
and/or wash at 6, 7, 8, 9, or 10 °C lower than the thermal melting point (Tm); low 
stringency conditions can utilize a hybridization and/or wash at 1 1, 12, 13, 14, 15, or 20 °C 
lower than the thermal melting point (Tm). Using the equation, hybridization and wash 
compositions, and desired Tm, those of ordinary skill will understand that variations in the 
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a segment of a full-length cDNA or gene sequence,' or the.epmplete cDNA or gene 
sequence. 

(b) As used herein, ''comparison window" includes reference to a contiguous and 
specified segment of a polynucleotide/polypeptide sequence, wherein the 
5 polynucleotide/polypeptide sequence may be compared to a reference sequence and 

wherein the portion of the polynucleotide/polypeptide sequence in the comparison window 
may comprise additions or deletions (i.e., gaps) compared to the reference sequence 
(which does not comprise additions or deletions) for optimal ahgnment of the two 
sequences. Generally, the comparison window is at least 20 contiguous nucleotides/amino 

10 acids residues in length, and optionally can be 30, 40, 50, 100, or longer. Those of skill in 
the art understand that to avoid a high similarity to a reference sequence due to inclusion 
of gaps in the polynucleotide/polypeptide sequence, a gap penalty is typically introduced 
and is subtracted from the number of matches. 

Methods of alignment of sequences for comparison are well-known in the art. 

15 Optimal alignment of sequences for comparison may be conducted by the local homology 
algorithm of Smith and Watemian, Adv. AppL Math, 2: 482 (1981); by the homology 
alignment algorithm of Needleman and Wunsch, J. MpL Biol. 48: 443 (1970); by the 
search for similarity method of Pearson and Lipman, Proc. Natl. , Acad. Sci. 85: 2444 
(1988); by computerized implementations of these algorithms, including, but not limited 

20 to: CLUSTAL in the PC/Gene program by Intelligenetics, Mountain View, California; 
GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software 
Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, Wisconsin, USA; 
the CLUSTAL program is well described by Higgins and Sharp, Gene 73:. 237-244 (1988); 
Higgins and Sharp, C^5/05 5: 151-1 53 (1989); Corpet, et al.. Nucleic Acids Research 

25 16: 1 088 1-?9.0 (1988); Huang, et aL, Computer Applications in the Bipsciences 8: 155-65 
(1992), and Pearson, et al. Methods in Molecular Biology 24: 307-331 (1994). 

The BLAST family of programs which can be used for database similarity searches 
includes: BLASTN for nucleotide query sequences against nucleotide database sequences; 
BLASTX for nucleotide query sequences against protein database sequences; BLASTP for 

30 protein query sequences against protein database sequences; TBLAS.TN for protein query 
sequences against nucleotide database sequences; and TBLASTX for nucleotide query 
sequences against nucleotide database sequences. See, Current Protocols in Molecular 
Biology^ Chapter 19, Ausubel, et aL^ Eds., Greene Publishing and Wiley-Interscience, New 
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York (1995); Ahschul et ai. 1 MoL Biol:. 2 1 5:403-410 (1 990); and, Altschul et aL, 
Nucleic Acids Res. 25:3389-3402 (1997). 

Software for performing BLAST analyses is publicly available, e.g., through the 
National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 
5 algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying 
short words of length W in the query sequence, which either match or satisfy some 
positive-valued threshold score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word score threshold. These 
initial neighborhood word hits act as seeds for initiating searches to find longer HSPs 

10 containing them. The word hits are then extended in both directions along each sequence 
for as far as the cumulative alignment score can be increased. Cumulative scores are 
calculated using, for nucleotide sequences, the parameters M (reward score for a pair of 
matching residues; always > 0) and N (penalty score for mismatching residues; always < 
0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. 

15 Extension of the word hits in each direction are halted when: the cumulative alignment 
score falls off by the quantity X from its maximum achieved value; the cumulative score 
goes to zero or below, due to the accumulation of one or more negative-scoring residue 
alignments; or the end of either sequence is reached. The BLAST algorithm parameters 
W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program 

20 (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an expectation (E) of 
10, a cutoff of 100, M=5, N— 4, and a comparison of both strands. For amino acid 
sequences, the BLAST? program uses as defaults a wordlength (W) of 3, an expectation 
(E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff (1989) Proc, 
Natl Acad, Sci, USA 89: 1 091 5). 

25 In addition to calculating percent sequence identity, the BLAST algorithm also 

performs a statistical analysis of the similarity between two sequences {see, e.g., Karlin & 
Altschul, Proc. Nat 7. Acad, Sci. USA 90:5873-5877 (1993)). One measure of similarity 
provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides 
an indication of the probability by which a match between two nucleotide or amino acid 

30 sequences would occur by chance. 

. BLAST searches assume that proteins can be modeled as random sequences. 
However, many real proteins comprise regions of noru-andom sequences which may be 
homopolymeric tracts, short-period repeats, or regions enriched in one or more amino 
acids. Such low-complexity regions may be aligned between unrelated proteins even 
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though other regions of the protein are entirely dissimilar. ' A number of low-complexity 
filter programs can be employed to reduce such low-complexity alignments. For example, 
the SEG (Wooten and Federhen, Comput. Chem., 17:149-163 (1993)) and XNU (Claverie 
and States, Comput. Cheni.^ 17:191-201 (1 993)). low-complexity filters can be employed 
5 alone or in combination. 

Unless otherwise stated, nucleotide and protein identity/similarity values provided 
herein are calculated using GAP (GCG Version 1 0) under default values. 

GAP (Global Alignment Program) can also be used to compare a polynucleotide or 
polypeptide of the present invention with a reference sequence. GAP uses the algorithm of 

10 Needleman and Wunsch (J. MoL Biol, 48 : 443-453, 1970) to find the alignment of two 

complete sequences that maximizes the number of matches and minimizes the number of 
gaps. GAP considers all possible alignments and gap positions and creates the alignment 
with the largest number of matched bases and the fewest gaps. It allows for the provision 
of a gap creation penalty and a gap extension penalty in units of matched bases. GAP 

15 must make a profit of gap creation penalty number of matches for each gap it inserts. If a 
gap extension penalty greater than zero is chosen, GAP must, in addition, make a profit for 
each gap inserted of the length of the gap times the gap extension penalty. Default gap 
creation penalty values and gap extension penalty values in Version 10 of the Wisconsin 
Genetics Software Package for protein sequences are 8 and 2, respectively. For 

20 nucleotide sequences the default gap creation penalty is 50 while the default gap extension 
penalty is 3. The gap creation and gap extension penalties can be expressed as an integer 
selected from the group of integers consisting of from 0 to 100. Thus, for example, the 
gap creation and gap extension penalties can each independently be: 0, 1, 2, 3, 4, 5, 6, 7, 8, 
9, 10, 15, 20, 30, 40, 50, 60 or greater. 

25 . GAP, presents one member of the family of best alignments;/ There may be many 

members of this family, but no other member has a better quality. GAP displays four 
figures of merit for ahgnments: Quality, Ratio, Identity, and Similarity. The Quality is the 
metric maximized in order to align the sequences. Ratio is the quality divided by the 
number of bases in the shorter segment. Percent Identity is the percent of the symbols that 

30 actually match. Percent Similarity is the percent of the symbols that are similai:. Symbols 
that are across from gaps are ignored. A similarity is scored when the scoring matrix value 
for a pair of symbols is greater than or equal to 0.50, the similarity threshold. The scoring 
matrix used in Version. 10 of the Wisconsin Genetics Software Package is BLOSUM62 
(^ee Henikoff & Henikoff (1989) Proc. Natl Acad, ScL 89: 10915). 
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Multiple alignment of the sequences can be performed using the CLUSTAL 
method of alignment (Higgins and Sharp (1989) CABIOS. i:15M53) with the default 
parameters (GAP PENALTY=10, GAP LENGTH PENALTY-10). Defauh parameters 
for pairwise alignments using the CLUSTAL method are KTUPLE 1, GAP PENALTY=3, 
5 WINDOW=5 and DIAGONALS SAVED-5. 

(c) As used herein, "sequence identity" or "identity" in the context of two nucleic 
acid or polypeptide sequences includes reference to the residues in the two sequences 
which are the same when aligned for maximum correspondence over a specified 
comparison window. When percentage of sequence identity is used in reference to 
10 proteins it is recognized that residue positions which are not identical often differ by 

conservative amino acid substitutions, where amino acid residues are substituted for other 
amino acid residues with similar chemical properties (e.g. charge or hydrophobicity) and 
therefore do not change the functional properties of the molecule. Where sequences differ 
in conservative substitutions, the percent sequence identity may be adjusted upwards to 
15 correct for the conser\^ative nature of the substitution. Sequences which differ by such 
conservative substitutions are said to have "sequence similarity" or "similarity". Means 
for making this adjustment are well-known to those of skill in the art. Typically this 
involves scoring a conservative substitution as a partial rather than a full mismatch, 
thereby increasing the percentage sequence identity. Thus, for example, where an identical 
20 amino acid is given a score of 1 and a non-conservative substitution is given a score of 
zero, a conservative substitution is given a score between zero and 1. The scoring of 
conservative substitutions is calculated, e.g., according to the algorithm of Meyers and 
Miller, Computer Applic, Biol Sci., 4: 11-17 (1988) e,g,, as implemented in the program 
PC/GENE (Intelligenetics, Mountain View, California, USA). 
25 (d) As used herein, "percentage of sequence identity" means the value determined 

by comparing two optimally aligned sequences over a comparison window, wherein the 
portion of the polynucleotide sequence in the comparison window may comprise additions 
or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise 
additions or deletions) for optimal alignment of the two sequences. The percentage is 
30 calculated by determining the number of positions at which the identical nucleic acid base 
or amino acid residue occurs in both sequences to yield the number of matched positions, 
dividing the number of matched positions by the total number of positions in the window 
of comparison and multiplying the result by 100 to yield the percentage of sequence 
identity. 
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Utilities 

The present invention provides, among other things, compositions and methods for 
modulating (i.e., increasing or decreasing) the level of polynucleotides and polypeptides of 
5 the present invention in plants. In particular, the polynucleotides and polypeptides of the 
present invention can be expressed temporally or spatially, e.g., at developmental stages, in 
tissues, and/or in quantities, which are uncharacteristic of non-recombinantly engineered 
plants. 

The present invention also provides isolated nucleic acids comprising 
10 polynucleotides of sufficient length and complementarity to a polynucleotide of the present 
invention to use as probes or amplification primers in the detection, quantitation, or 
isolation of gene transcripts. For example, isolated nucleic acids of the present invention 
can be used as probes in detecting deficiencies in the level of mRNA in screenings for 
desired transgenic plants, for detecting mutations in the gene (e.g., substitutions, deletions, 

15 or additions), for monitoring upregulation of expression or changes in enzyme activity in 
screening assays of compounds, for detection of any number of allelic variants 
(poljmiorphisms), orthologs, or paralogs of the gene, or for site directed mutagenesis in 
eukaryotic cells (see, e.g., U.S. Patent No. 5,565,350). The isolated nucleic acids of the 
present invention can also be used for recombinant expression of their encoded 

20 polypeptides, or for use as immunogens in the preparation and/or screening of antibodies. 
The isolated nucleic acids of the present invention can also be employed for use in sense or 
antisense suppression of one or more genes of the present invention in a host cell, tissue, or 
plant. Attachment of chemical agents which bind, intercalate, cleave and/or crosslink to 
the isolated nucleic acids of the present invention can also be used to modulate 

25 transcription or translation. ; . : ; : : : 

The present invention also provides isolated proteins comprising a polypeptide of 
the present invention (e.g., preproenzyme, proenzyme, or enzymes). The present invention 
also provides proteins comprising at least one epitope from a polypeptide of the present 
invention. The proteins of the present invention can be employed in assays for enzyme 

30 agonists or antagonists of enzyme function, or for use as immunogens or antigens to obtain 
antibodies specifically immunoreactive with a protein of the present invention. Such 
antibodies can be used in assays for expression levels, for identifying and/or isolating 
nucleic acids of the present invention from expression libraries, for identification of 



BNSDOCID: <WO 0070059A2_I_> 



wo 00/70059 PCT/USOO/1 1687 

- 16 - 

homologous pol>peptides from other species, or for purification of polypeptides of the 
present invention. 

The isolated nucleic acids and poljpeptides of the present invention can be used 
over a broad range of plant types, particularly monocots such as the species of the family 
5 Gramineae including Hordeum, Secale, Oiyza, Triticum, Sorghum (e.g., S, bicolor) and 
Zea (e.g., Z. mays), and dicots such as Glycine, 

The isolated nucleic acid and proteins of the present invention can also be used in 
species from the genera: Cucurbita, Rosa, Vitis, Juglans, Fragaria, Lotus, Medicago, 
Onobrychis, Trifoliwru Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, 
10 Arabidopsis, Brassica, Raphanus, Sinapis, Atropa, Capsicum. Datura, Hyoscyamus, 

Lycopersico?K Nicotiana, Solanum, Petunia. Digitalis, Majorana, Ciahorium, Helianthus, 
Lactuca, Bromus, Asparagus, Antirrhinum. Heterocallis, Nemesis, Pelargonium, Panieum, 
Pennisetum, Ranunculus, Senecio, Salpiglossis, Cucumis, Browallia, Pisum, Phaseolus, 
Lolium, d^nd Avena. 

15 

Nucleic Acids 

The present invention provides, among other things, isolated nucleic acids of RNA, 
DNA, and analogs and/or chimeras thereof, comprising a polynucleotide of the present 
invention. 

20 A polynucleotide of the present invention is inclusive of those in the accompanying 

Sequence Listing and: 

(a) an isolated polynucleotide encoding a polypeptide of the present invention, such 

as those polypeptides included in the accompanying Sequence Listing, including 

exemplary polynucleotides of the present invention; 
25 (b) an isolated polynucleotide which is the product of amplification from a plant 

nucleic acid library using primer pairs which selectively hybridize under stringent 

conditions to loci within a polynucleotide of the present invention; 

(c) an isolated polynucleotide which selectively hybridizes to a polynucleotide of 

(a)or(b); 

30 (d) an isolated polynucleotide having a specified sequence identity with 

polynucleotides of (a), (b), or (c); 

(e) an isolated polynucleotide encoding a protein having a specified number of 
contiguous amino acids from a prototype polypeptide, wherein the protein is specifically 
recognized by antisera elicited by presentation of the protein and wherein the protein does 
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not detectably imrnunoreact'lo antis^ra which has b'een fully immunosorbed with the 
protein; 

(f) complementary sequences of poKnucleotides of (a), (b), (c), (d), or (e); and 

(g) an isolated polynucleotide comprising at least a specific number of contiguous 
5 nucleotides from a polynucleotide of (aj, (b), (c), (d), (e), or (f); 

(h) an isolated polynucleotide from a full-length enriched cDNA library having the 
physico-chemical property of selectively hybridizing to a polynucleotide of (a), (b), (c), 
(dX(e),(f),or(g); 

(i) an isolated polynucleotide made by the process of: 1) providing a full-length 
10 enriched nucleic acid library, 2) selectively hybridizing the polynucleotide to a 

polynucleotide of (a), (b), (c), (d), (e), (f), (g), or (h), thereby isolating the polynucleotide 
from the nucleic acid library. 



A. Polynucleotides Encoding A Polypeptide of the Present Invention 

15 As indicated in (a), above, the present invention provides isolated nucleic acids 

comprising a polynucleotide of the present invention, wherein the polynucleotide encodes 
a polypeptide of the present invention. Every nucleic acid sequence herein that encodes a 
polypeptide also, by reference to the genetic code, describes every possible silent variation 
of the nucleic acid. One of ordinary skill will recognize that each codon in a nucleic acid 

20 (except AUG, which is ordinarily the only codon for methionine; and UGG , which is 

ordinarily the only codon for tryptophan) can be modified to yield a functionally identical 
molecule. Thus, each silent variation of a nucleic acid which encodes a polypeptide of the 
present invention is implicit in each described polypeptide sequence and is within the 
scope of the present invention. Accordingly, the present invention includes 

25 polynueleotides of the present invention and polynucleotides encoding a polypeptide of 
the present invention. : , r 

B. Polynucleotides Amplified from a Plant Nucleic Acid Library 

As indicated in (b), above, the present invention provides an isolated nucleic acid 
30 comprising a polynucleotide of the present invention, wherein the polynucleotides are 
amplified, under nucleic acid amplification conditions, from a plant nucleic acid library. 
Nucleic acid amplification conditions for each of the variety of amplification methods are 
well known to those of ordinary skill in the art. The plant nucleic acid library can be 
constructed from a monocot such as a cereal crop. Exemplary cereals include maize, 
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sorghum, alfalfa, canola, wheat - or rice. The plant nucleic acid library can also be 
constructed from a dicot such as soybean. Zea mays lines B73, PHREU A632, BMS- 
P2#10, W23, and Mol7 are known and publicly available. Other publicly known and 
available maize lines can be obtained from the Maize Genetics Cooperation (Urbana, IL). 
5 Wheat lines are available from the Wheat Genetics Resource Center (Manhattan, KS). 

The nucleic acid library may be a cDNA library, a genomic library, or a library 
generally constructed from nuclear transcripts at any stage of intron processing. cDNA 
libraries can be normalized to increase the representation of relatively rare cDN As. In 
optional embodiments, the cDNA library is constructed using an enriched full-length 

10 cDNA synthesis method. Examples of such methods include Oligo-Capping (Maruyama, 
K. and Sugano, S. Gene 138: 171-174, 1994), Biotinylated CAP Trapper (Caminci, et al 
Genomics 37: 327-336, 1996), and CAP Retention Procedure (Edery, E., Chu, L.L., et al. 
Molecular and Cellular Biology^ 15: 3363-3371, 1995). Rapidly growing tissues or rapidly 
dividing cells are preferred for use as an mRNA source for construction of a cDNA library. 

15 Growth stages of maize are described in "How a Com Plant Develops," Special Report 

No. 48, Iowa State University of Science and Technology Cooperative Extension Service, 
Ames, Iowa, Reprinted February 1 993 . 

A polynucleotide of this embodiment (or subsequences thereof) can be obtained, 
for example, by using amplification primers which are selectively hybridized and primer 

20 extended, under nucleic acid amplification conditions, to at least two sites within a 

polynucleotide of the present invention, or to two sites within the nucleic acid which flank 
and comprise a polynucleotide of the present invention, or to a site within a polynucleotide 
of the present invention and a site within the nucleic acid which comprises it. Methods for 
obtaining 5 ' and/or 3' ends of a vector insert are well known in the art. See, e.g., RACE 

25 (Rapid Amplification of Complementar>' Ends) as described in Frohman, M. A., in PCR 
Protocols: A Guide to Methods and Applications, M. A. Innis, D. H. Gelfand, J. J. 
Sninsky, T. J. White, Eds. (Academic Press, hic, San Diego), pp. 28-38 (1990)); see also, 
U.S. Pat. No. 5,470,722, and Current Protocols in Molecular Biology^ Unit 15.6, Ausubel, 
et al, Eds., Greene Publishing and Wiley-Interscience, New York (1995); Frohman and 

30 Martin, Techniques 1 : 1 65 (1 989). 

Optionally, the primers are complementary to a subsequence of the target nucleic 
acid which they amplify but may have a sequence identity ranging fi^om about 85% to 99% 
relative to the polynucleotide sequence which they are designed to anneal to. As those 
skilled in the art will appreciate, the sites to which the primer pairs will selectively 
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hybridize are chosen such thdt a single coritiguous nirclefc acid can be formed under the 
desired nucleic acid amplification conditions. The primer length in nucleotides is selected 
from the group of integers consisting of from at least 1 5 to 50. Thus, the primers can be at 
least 15, 18, 20, 25, 30, 40, or 50 nucleotides in length. Those of skill will recognize that a 
5 lengthened primer sequence can be employed to increase specificity of binding (i.e., 
annealing) to a target sequence. A non-annealing sequence at the 5 'end of a primer (a 
"tail") can be added, for example, to introduce a cloning site at the terminal ends of the 
amplicon. 

The amplification products can be translated using expression systems well known 
10 to those of skill in the art. The resulting translation products can be confirmed as 
polypeptides of the present invention by, for example, assaying for the appropriate 
catalytic activity (e.g., specific activity and or substrate specificity), or verifying the 
pre.>ence of one or more epitopes which are specific to a polypeptide of the present , 
invention. Methods for protein synthesis from PCR derived templates are known in the art 
15 and available commercially. See, e.g., Amersham Life Sciences, Inc, Catalog *97, p.354., 

C. Polynucleoiicles Wliich Selectively Hybridize to a Polynucleotide of (A) or (B) 

As indicated in (c), above, the present invention provides isolated nucleic acids 
comprising polynucleotides of the present invention, wherein the polynucleotides 

20 selectively hybridize, under selective hybridization conditions, to a polynucleotide of 

sections (A) or (B) as discussed above. Thus, the polynucleotides of this embodimen%can 
be used for isolating, detecting, and/or quantifying nucleic acids comprising the . 
polynucleotides of (A) or (B), For example, polynucleotides of the present invention can 
be used to identify, isolate, or amplify partial or full-length clones in a deposited library. 

25 In some cinbodimcnts, the polynucleotides are genomic or cDNA seque;nces isolated or 
otherw isc ci>mplcmcntary to a cDNA from a dicot or monocot nucleic acid library. 
Exemplar) species of monocots and dicots include, but are not limited to: maize, canola, 
soybean, cotton, wheat, sorghum, sunflower, alfalfa, oats, sugar cane, millet, barley, and 
rice. The cDN A library comprises at least 50% to 95% full-length sequences (for 

30 example, al least 50%, 60%, 70%, 80%, 90%, or 95% full-length sequences). The cDNA ^ 
libraries can be normalized to increase the representation of rare sequences. See, e.g., U.S. 
Patent No. 5,482*845. Low stringency hybridization conditions are typically, but not 
exclusively, employed with sequences having a reduced sequence identity relative to 
complementary sequences. Moderate and high stringency conditions can optionally be 
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employed for sequences of greater identity. Low stringency conditions allow selective 
hybridization of sequences having about 70% to 80% sequence identity and can be 
employed to identify orthologoiis or paralogous sequences. 

5 D. Polynucleotides Having a Specific Sequence Identity with the Polynucleotides of (A), 
(B)or(C) 

As indicated in (d), above, the present invention provides isolated nucleic acids 
comprising polynucleotides of the present invention, wherein the polynucleotides have a 
specified identity at the nucleotide level to a polynucleotide as disclosed above in sections 

10 (A), (B), or (C), above. Identity can be calculated using, for example, the BLAST, 

CLUSTALW, or GAP algorithms under default conditions. The percentage of identity to a 
reference sequence is at least 50% and, rounded upwards to the nearest integer, can be 
expressed as an integer selected from the group of integers consisting of from 50 to 99. 
Thus, for example, the percentage of identity to a reference sequence can be at least 60%, 

15 70%, 75%, 80%, 85%, 90%, or 95%. 

Optionally, the polynucleotides of this embodiment will encode a polypeptide that 
will share an epitope with a polypeptide encoded by the polynucleotides of sections (A), 
(B), or (C). Thus, these polynucleotides encode a first polypeptide which elicits 
production of antisera comprising antibodies which are specifically reactive to a second 

20 polypeptide encoded by a polynucleotide of (A), (B), or (C). However, the first 

polypeptide does not bind to antisera raised against itself when the antisera has been fully 
immunosorbed with the first polypeptide. Hence, the polynucleotides of this embodiment 
can be used to generate antibodies for use in, for example, the screening of expression 
libraries for nucleic acids comprising polynucleotides of (A), (B), or (C), or for 

25 purification of, oi* iii immunoassays for, polypeptides encoded by the polynucleotides of 

(A), (B), or (C). The polynucleotides of this embodiment comprise nucleic acid sequences 
which can be employed for selective hybridization to a polynucleotide encoding a 
polypeptide of the present invention. 

Screening polypeptides for specific binding to antisera can be conveniently 

30 achieved using peptide display libraries. This method involves the screening of large 
collections of peptides for individual members having the desired function or structure. 
Antibody screening of peptide display libraries is well known in the art. The displayed 
peptide sequences can be from 3 to 5000 or more amino acids in length, frequently from 5- 
100 amino acids long, and often from about 8 to 1 5 amino acids long. In addition to direct 
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chemical synthetic methods for generating peptide libraries, several recombmant DNA 
methods have been described. One type involves the display of a peptide sequence on the 
surface of a bacteriophage or cell. Each bacteriophage or cell contains the nucleotide 
sequence encoding the particular displayed peptide sequence. Such methods are described 
5 in PCT patent publication Nos. 91/17271, 91/18980, 91/19818, and 93/08278. Other 
systems for generating libraries of peptides have aspects of both in vitro chemical 
synthesis and recombinant methods. See. PCT Patent publication Nos. 92/05258, 
92/14843, and 97/20078. See also, U.S. Patent Nos. 5,658,754; and 5,643,768. Peptide 
display libraries, vectors, and screening kits are commercially available from such 
10 suppliers as Invitrogen (Carlsbad, CA). . 

E, Polynucleotides Encoding a Protein Having a Subsequence from a Prototype 
Polypeptide and Cross-Reactive to the Proior\>pe Polypeptide . 

As indicated in (e), above, the present invention provides isolated nucleic acids 

1 5 comprising polynucleotides of the present invention, v^herein the polynucleotides encode a 
protein having a subsequence of contiguous amino acids from a prototype polypeptide of 
the present invention such as are provided in (a), above. The length of contiguous amino 
acids from the prototype polypeptide is selected from the group of integers consisting of 
from at least 10 to the number of amino acids within the prototype sequence. Thus, for 

20 example, the polynucleotide can encode a polypeptide having a subsequence having at 
least 10, 15, 20, 25, 30, 35, 40, 45, or 50, contiguous amino acids from the prototype z - 
polypeptide. Further, the number of such subsequences encoded by a polynucleotide of 
the instant embodiment can be any integer selected from the group consisting of from 1 to 
20, such as 2, 3, 4, or 5. The subsequences can be separated by any integer of nucleotides 

25 from 1 to the number of nucleotides in the sequence such as at least 5 , ,10, 15, 25^ 50, 100, 
or 200 nucleotides. 

The proteins encoded by polynucleotides of this embodiment, when presented as an 
immunogen, elicit the production of polyclonal antibodies w^hich specifically bind to a 
prototype polypeptide such as but not limited to, a polypeptide encoded by the 
30 polynucleotideof (a) or (b), above. Generally, how^ever, a protein encoded by a 

polynucleotide of this embodiment does not bind to antisera raised against the prototype 
polypeptide when the antisera has been ftilly immunosorbed with the prototype 
polypeptide. Methods of making and assaying for antibody binding specificity/affinity are 
well known in the art. Exemplary immunoassay formats include ELIS A, competitive 
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immunoassays, radioimmunoassays. Western blots, indirect immunofluorescent assays and 
the like. 

In a preferred assay method, fully immunosorbed and pooled antisera which is 
elicited to the prototype polypeptide can be used in a competitive binding assay to test the 
5 protein. The concentration of the prototype polypeptide required to inhibit 50% of the 
binding of the antisera to the prototype polypeptide is determined. If the amount of the 
protein required to inhibit binding is less than lAvice the amount of the prototype protein, 
then the protein is said to specifically bind to the antisera elicited to the immunogen. 
Accordingly, the proteins of the present invention embrace allelic variants, conservatively 

10 modified variants, and minor recombinant modifications to a prototype polypeptide. 

A polynucleotide of the present invention optionally encodes a protein having a 
molecular weight as the non-glycosylated protein within 20% of the molecular weight of 
the full-length non-glycosylated polypeptides of the present invention. Molecular weight 
can be readily determined by SDS-PAGE under reducing conditions. Optionally, the 

15 molecular weight is wathin 15% of a full length polypeptide of the present invention, more 
preferably within 10% or 5%, and most preferably within 3%, 2%, or 1%> of a full length 
polypeptide of the present invention. 

Optionally, the polynucleotides of this embodiment will encode a protein having a 
specific enzymatic activity at least 50%, 60%, 80%, or 90% of a cellular extract 

20 comprising the native, endogenous full-length polypeptide of the present invendon. 

Further, the proteins encoded by polynucleotides of this embodiment will optionally have a 
substantially similar affinity constant (Km ) and/or catalytic activity (i.e., the microscopic 
rate constant, kcai) as the native endogenous, full-length protein. Those of skill in the art 
will rccogni/c that kca/Km value determines the specificity for competing substrates and is 

25 often referred to as the specificity constant. Proteins of this embodiment can have a 
kcai/Kn. value ui Icusi 10% of a full-length polypeptide of the present invention as 
determined using the endogenous substrate of that polypeptide. Optionally, the kcat/Km 
value will he al least 20%, 30%, 40%), 50%, and most preferably at least 60%, 70%, 80%), 
90%, or 95**/^ the kcat/Km value of the fiill-length polypeptide of the present invention. 

30 Determination of kcai, , and kcat/Km can be determined by any number of means well 

known to those of skill in the art. For example, the initial rates (i.e., the first 5% or less of 
the reaction) can be determined using rapid mixing and sampling techniques (e.g., 
continuous- flow, siopped-flow, or rapid quenching techniques), flash photolysis, or 
relaxation methods (e.g.;, temperature jumps) in conjunction with such exemplary methods 
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of measuring as spectrophotometry, spectrofluorimetry, . nuc lear magnetic resonance, or 
radioactive procedures. Kinetic values are conveniently obtained using a Lineweaver- 
Burk or Eadie-Hofstee plot. 

5 F. Polynucleotides Complementmy to the Polynucleotides of (A)'(E) 

As indicated in (f), above, the present invention provides isolated nucleic acids 
comprising polynucleotides complementary to the polynucleotides of paragraphs A-E, 
above. As those of skill in the art will recognize, complementary sequences base-pair 
throughout the entirety of their length with the polynucleotides of sections (A)-(E) (i.e., 
10 have 100% sequence identity over their entire length). Complementary bases associate 
through hydrogen bonding in double stranded nucleic acids. For example, the following 
base pairs are complementary: guanine and cytosine; adenine and thymine; and adenine 
and uracil- 

1 5 G. Polynucleotides Wiich are Subsequences of the Polynucleotides of (A)-(F) 

As indicated in (g), above, the present invention provides isolated nucleic acids 
comprising polynucleotides which comprise at least 15 contiguous bases from the 
polynucleotides of sections (A) through (F) as discussed above. The length of the 
polynucleotide is given as an integer selected from the group consisting of from at least 15 

20 to the length of the nucleic acid sequence from which the polynucleotide is a subsequence 
of Thus, for example, polynucleotides of the present invention are inclusive of 
polynucleotides comprising at least 15, 20, 25, 30, 40, 50, 60, 75, or 100 contiguous 
nucleotides in length from the polynucleotides of (A)-(F). Optionally, the number of such 
subsequences encoded by a polynucleotide of the instant embodiment can be any integer 

25 selected from the group consisting of from 1 to 20, such as 2, 3, 4, or 5 . The subsequences 
can be separated by any integer of nucleotides from 1 to the number of nucleotides in the 
sequence such as at least 5, 10, 15, 25, 50, 100, or 200 nucleotides. 

Subsequences can be made by in vitro synthetic, in vitro biosynthetic, or in vivo 
recombinant methods. In optional embodiments, subsequences can be made by nucleic 

30 acid amplification. For example, nucleic acid primers will be constructed to selectively 
hybridize to a sequence (or its complement) within, or co-extensive with, the coding 
region. 

The subsequences of the present invention can comprise structural characteristics 
of the sequence from which it is derived. Alternatively, the subsequences can lack certain 
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Structural characteristics of the larger sequence from which it is derived such as a poly (A) 
tail. Optionally, a subsequence from a poKaiucleotide encoding a polypeptide having at 
least one epitope in common with a prototype polypeptide sequence as provided in (a), 
above, may encode an epitope in common with the prototype sequence. Alternatively, the 
5 subsequence may not encode an epitope in common with the prototype sequence but can 
be used to isolate the larger sequence by. for example, nucleic acid hybridization with the 
sequence from which it's derived. Subsequences can be used to modulate or detect gene 
expression by introducing into the subsequences compounds which bind, intercalate, 
cleave and/or crosslink to nucleic acids. Exemplary compounds include acridine, psoralen, 
10 phenanthroline, naphthoquinone, daunomycin or chloroethylaminoaryl conjugates. 

H. Polynucleotides From a Full-length Enriched cDNA Librmy Having the Physico- 
Chemical Property of Selectively Hybridizing to a Polynucleotide of (A)'(G) 
As indicated in (h), above, the present invention provides an isolated 

15 polynucleotide from a full-length enriched cDNA library having the physico-chemical 
property of selectively hybridizing to a polynucleotide of paragraphs (A), (B), (C), (D), 
(E), (F), or (G) as discussed above. Methods of constructing full-length eiuriched cDNA 
libraries are known in the art and discussed briefly below. The cDNA library comprises at 
least 50% to 95% full-length sequences (for example, at least 50%, 60%, 70%, 80%, 90%, 

20 or 95% full-length sequences). The cDNA library can be constructed from a variety of 
tissues from a monocot or dicot at a variety of developmental stages. Exemplary species 
include maize, wheat, rice, canola, soybean, cotton, sorghum, sunflower, alfalfa, oats, 
sugar cane, millet, barley, and rice. Methods of selectively hybridizing, under selective 
hybridization conditions, a polynucleotide from a full-length enriched library to a 

25 polynucleotide of the present invention are known to those of ordinary skill in the art. Any 
number of stringency conditions can be employed to allow for selective hybridization. In 
optional embodiments, the stringency allows for selective hybridization of sequences 
having at least 70%, 75%, 80%, 85%, 90%, 95%, or 98% sequence identity over the length 
of the hybridized region. Full-length enriched cDNA libraries can be normalized to 

30 increase the representation of rare sequences. 

7. Polynucleotide Products Made by a cDNA Isolation Process 

As indicated in (I), above, the present invention provides an isolated polynucleotide 
made by the process of: 1) providing a full-length enriched nucleic acid library, 2) 
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selectively hybridizing the polynucleotide to a polynucleolide of paragraphs (A), (B), (C), 
(D), (E), (F), (G, or (H) as discussed above, and thereby isolating the polynucleotide from 
the nucleic acid library. Full-length enriched nucleic acid libraries are constructed as 
discussed in paragraph (G) and below. Selective hybridization conditions are as discussed 
5 in paragraph (G). Nucleic acid purification procedures are well known in the art. 

Purification can be conveniently accomplished using solid-phase methods; such methods 
are well known to those of skill in the art and kits are available from commercial suppliers 
such as Advanced Biotechnologies (Surrey, UK). For example, a polynucleotide of 
paragraphs (AHH) can be immobilized to a solid support such as a membrane, bead, or 
10 particle. See, e.g., U.S. Patent No. 5,667,976. The polynucleotide product of the present 
process is selectively hybridized to an immobilized polynucleotide and the solid support is 
subsequently isolated from non-hybridized polynucleotides by methods including, but not 
limited to, centrifugation, magnetic separation, filtration, electrophoresis, and the like. 

15 Construction of Nucleic Acids 

The isolated nucleic acids of the present invention can be made using (a) standard 
recombinant methods, (b) synthetic techniques, or combinations thereof In some 
embodiments, the polynucleotides of the present invention will be cloned, amplified, or 
otherwise constructed from a monocot such as maize, rice, or wheat, or a dicot such as 
20 soybean. 

The nucleic acids may conveniently comprise sequences in addition to a 
polynucleotide of the present invention, For example, a multi-cloning site comprising one 
or more endonuclease restriction sites may be inserted into the nucleic acid to aid in 
isolation of the polynucleotide. Also, translatable sequences may be inserted to aid in the 

25 isolation of the translated polynucleotide of the present invention. For example, a hexa- 
histidine marker sequence provides a convenient means to purify theiprQteins of the 
present invention. A polynucleotide of the present invention can be attached, to a vector, 
adapter, or linker for cloning, and/or expression of a polynucleotide of the present, 
invention. Additional sequences may be added to such cloning and/or expression 

30 sequences to optimize their function in cloning and/or expression, to aid in isolation of the 
polynucleotide, or to improve the introduction of the polynucleotide into a cell. Typically, 
the length of a nucleic acid of the present invention less the length of its polynucleotide of 
the present invention is less than 20 kilqbase pairs, often less than 15 kb, and frequently 
less than 10 kb. Use of cloning vectors, expression vectors, adapters, and linkers is well 
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known and extensively described in the an. For a description of various nucleic acids see, 
for example, Stratagene Cloning Systems, Catalogs 1999 (La JoUa, CA); and, Amersham 
Life Sciences, Inc, Catalog '99 (Arlington Heights, IL). 

5 A. Recombinant Methods for Constructing Nucleic Acids 

The isolated nucleic acid compositions of this invention, such as RNA, cDNA, 
genomic DNA, or a hybrid thereof, can be obtained from plant biological sources using 
any number of cloning methodologies known to those of skill in the art. In some 
embodiments, oligonucleotide probes which selectively hybridize, under stringent 

10 conditions, to the polynucleotides of the present invention are used to identify the desired 
sequence in a cDNA or genomic DNA library. Isolation of RNA, and construction of 
cDNA and genomic libraries is well known to those of ordinary skill in the art. See, e.g.. 
Plant Molecular Biology^- A Laboratory' Manual, Clark, Ed., Springer-Verlag, Berhn 
(1997); and. Current Protocols in Molecular Biology^ Ausubel, et al., Eds., Greene 

15 Publishing and Wiley- Interscience, New York (1995). 

AJ. Full-length Enriched cDNA Libraries 

A number of cDNA synthesis protocols have been described which provide 
enriched full-length cDNA libraries. Enriched full-length cDNA libraries are constructed 

20 to comprise at least 600%, and more preferably at least 70%, 80%, 90% or 95% full-length 
inserts amongst clones containing inserts. The length of insert in such libraries can be at 
least 2,3, 4, 5, 6, 7, 8, 9, 10 or more kilobase pairs. Vectors to accommodate inserts of 
these sizes are known in the art and available commercially. See, e.g., Stratagene's lambda 
ZAP Express (cDNA cloning vector with 0 to 12 kb cloning capacity). An exemplary 

25 method of constructing a greater than 95% pure full-length cDNA library is described by 
Caminci et aL,. Genomics, 37:327-336 (1996). Other methods for producing full-length 
libraries are known in the art. See, e.g., Edery et al., Mol Cell 5/o/.,15(6):3363-3371 
(1995); and, PCX Application WO 96/3498 1. 

30 A2 Normalized or Subtracted cDNA Libraries 

A non-normalized cDNA library represents the mRNA population of the tissue it 
was made from. Since unique clones are out-numbered by clones derived from highly 
expressed genes their isolation can be laborioiis. Normalization of a cDNA library is the 
process of creating a library in which each clone is more equally represented. 
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Construction of noTmalized libraries is "described in Ko:: NucL Acids, Res,, 18(19):5705- 
571 1 (1990); Patanjali et al, Proc, Natl. Acad. U.S.A,. 88:1943-1947 (1991); U.S. Patents 
5,482,685, 5,482,845, and 5,637,685. In an exemplary method described by Soares et ai, 
normalization resulted in reduction of the abundance of clones from a range of four orders 
5 of magnitude to a narrow range of only 1 order of magnitude. Proc. Natl. Acad, Sci. USA, 
91:9228-9232 (1994). 

Subtracted cDNA libraries are another means to increase the proportion of less 
abundant cDNA species. In this procedure, cDNA prepared from one pool of mRNA is 
depleted of sequences present in a second pool of mRNA by hybridization. The 

10 cDNA:mRNA hybrids are removed and the remaining un-hybridized cDNA pool is 

enriched for sequences unique to that pool. See, Foote et al. in, Plant Molecular Biology': 
A Laboratory^ Manual, Clark, Ed., Springer- Verlag, Berlin (1997); Kho and Zarbl, 
Technique^ 3(2):58-63 (1991); Sive and St. Jolin, NucL Acids 16(22): 10937 (1988); 
Current Protocols in Molecular Biology , Ausubel, et aL, Eds., Greene Publishing and 

15 Wiley-lnterscience, New York (1995); and, Swaroop et al, NucL Acids Res., 19)8):1954 
(1991). cDNA subtraction kits are commercially available. See, e.g., PCR-Select 
(Clontech, Palo Alto, CA). 

To construct genomic libraries, large segments of genomic DNA are generated by 
fragmentation, e.g. using restriction endonucleases, and are ligated with vector DNA to 

20 form concatemers that can be packaged into the appropriate vector. Methodologies to 

accomplish these ends, and sequencing methods to verify the sequence of nucleic acids are 
well known in the art. Examples of appropriate molecular biological techniques and 
instructions sufficient to direct persons of skill through many construction, cloning, and 
screening methodologies are found in Sambrook, et aL, Molecular Cloning: A Laboratory 

25 Manual, ^nd ^d.. Cold Spring Harbor Laboratory Vols. 1-3 (1989), Methods in 

Enzymplogy^ Vol. 152: Guide to Molecular Cloning Techniques, Berger and Kimmel, 
Eds., San Diego: Academic Press, Inc. (1987), Current Protocols in Molecular Biology, 
Ausubel, et aL^ Eds., Greene Publishing and Wiley-lnterscience, New York (1995); Plant 
Molecular Biology: A Laboratory Manual, Clark, Ed., Springer-Verlag, Berlin (1997). 

30 Kits for construction of genomic libraries are also commercially available. 

The cDNA or genomic library can be screened using a probe based upon the 
sequence of a polynucleotide of tiie present invention such as those disclosed herein. ' 
Probes may be used to hybridize with genomic DNA or cDNA sequences to isolate 
homologous genes in the same or different plant species. Those of skill in the art will 
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appreciate that various degrees of stringency of hybridization can be employed in the 
assay; and either the hybridization or the wash medium can be stringent. 

The nucleic acids of interest can also be amplified from nucleic acid samples using 
amplification techniques. For instance, polymerase chain reaction (PCR) technology can 
5 be used to amplify the sequences of polynucleotides of the present invention and related 
genes directly from genomic DNA or cDNA libraries. PCR and other in vitro 
amplification methods may also be useful, for example, to clone nucleic acid sequences 
that code for proteins to be expressed, to make nucleic acids to use as probes for detecting 
the presence of the desired niRNA in samples, for nucleic acid sequencing, or for other 
10 purposes. The T4 gene 32 protein (Boehringer Mannheim) can be used to improve yield of 
long PCR products. 

PCR-based screening methods have been described. Wilfinger et al describe a 
PCR-based method in which the longest cDNA is identified in the first step so that 
incomplete clones can be eliminated from study. BioTechniques, 22(3): 481-486 (1997). 
15 Such methods are particularly effective in combination with a full-length cDNA 
construction methodology, above. 

B. Synthetic Methods for Constructing Nucleic Acids 

The isolated nucleic acids of the present invention can also be prepared by direct 

20 chemical synthesis by methods such as the phosphotriester method of Narang et al, Meth, 
EnzymoL 68: 90-99 (1979); the phosphodiester method of Brown et al, Meth, Enzymol 
68: 109-151 (1979); the diethylphosphoramidite method of Beaucage a/., Tetra, Lett. 22: 
1859-1862 (1981); the solid phase phosphoramidite triester method described by Beaucage 
and Gai-uthers, Tetra. Letts. 22(20): 1859-1862 (1981), e,g,, using an automated 

25 synthesizer, e,g„ as described in Needham-VanDevanter et al. Nucleic Acids Res., 12: 

6159-6168 (1984); and, the solid support method of U.S. Patent No. 4,458,066. Chemical 
synthesis generally produces a single stranded oligonucleotide. This may be converted 
into double stranded DNA by hybridization with a complementary sequence, or by 
polymerization with a DNA polymerase using the single strand as a template. One of skill 

30 will recognize that while chemical synthesis of DNA is best employed for sequences of 
about 100 bases or less, longer sequences may be obtained by the ligation of shorter 
sequences. 

Rec mbinant Expression Cassettes 
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The present invention further provides recombinant expression cassettes 
comprising a nucleic acid of the present invention. A nucleic acid sequence coding for the 
desired polypeptide of the present invention, for example a cDNA or a genomic sequence 
encoding a full length polypeptide of the present invention, can be used to construct a 
5 recombinant expression cassette which can be introduced into the desired host cell. A 
recombinant expression cassette will typically comprise a polynucleotide of the present 
invention operably linked to transcriptional initiation regulatory sequences which will 
direct the transcription of the polynucleotide in the intended host cell, such as tissues of a 
transformed plant. 

10 For example, plant expression vectors may include (1) a cloned plant gene under 

the transcriptional control of 5' and 3* regulatory sequences and (2) a dominant selectable 
marker. Such plant expression vectors may also contain, if desired,, a promoter regulatory 
region (e.g., one conferring inducible or constitutive, environmentally- or 
developmentally-regulated, or cell- or tissue-specific/selective expression), a transcription 
15 initiation start site, a ribosome binding site, an RNA processing signal, a transcription 
termination site, and/or a polyadenylation signal, 

A plant promoter fragment can be employed which will direct expression of a 
polynucleotide of the present invention in all tissues of a regenerated plant. Such 
promoters are referred to herein as "constitutive" promoters and are active under most 
20 environmental conditions and states of development or cell differentiation. Examples of 
constitutive promoters include the cauliflower mosaic virus (CaMV) 35S transcription ^ 
initiation region, the V- or 2*- promoter derived from T-DNA of Agi'obacterium 
tumefaciens, the ubiquitin 1 promoter, the Smas promoter, the cinnamyl alcohol 
dehydrogenase promoter (U.S. Patent No. 5,683,439), the Nos promoter, the pEmu 
25 prompter,,the rubisco promoter, and the GRP 1-8 promoter. • 

Altematively, the plant promoter can direct expression of a polynucleotide of the 
present invention in a specific tissue or may be otherwise under more precise 
environmental or developmental control. Such promoters are referred to here as . ■ / 
"inducible" promoters. Environmental conditions that may effect transcription by / . 
30 inducible promoters include pathogen attack, anaerobic conditions, or the presence of 
light. Examples of inducible promoters are the Adhl promoter which is inducible by 
hypoxia or cold stress, the Hsp70 promoter which is inducible by heat stress, and the 
PPDK promoter which is inducible by light. 
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Examples of promoters under developmental control include promoters that initiate 
transcription only, or preferentially, in certam tissues, such as leaves, roots, fruit, seeds, or 
flowers. Exemplary promoters include the anther specific promoter 5126 (U.S. Patent Nos. 
5,689,049 and 5,689,051), glob-1 promoter, and gamma-zein promoter. See also US 
5 patent applications 60/163,1 14, filed November 2, 1999, and 60/155,859, filed September 
24, 1999. The operation of a promoter may also vary depending on its location in the 
genome. Thus, an inducible promoter may become fully or partially constitutive in certain 
locations. 

Both heterologous and non-heterologous (i.e., endogenous) promoters can be 
1 0 employed to direct expression of the nucleic acids of the present invention. These 

promoters can also be used, for example, in recombinant expression cassettes to drive 
expression of antisense nucleic acids to reduce, increase, or alter concentration and/or 
composition of the proteins of the present invention in a desired tissue. Thus, in some 
embodiments, the nucleic acid construct will comprise a promoter, functional in a plant 
15 cell, operably linked to a polynucleotide of the present invention. Promoters useful in 

these embodiments include the endogenous promoters driving expression of a polypeptide 
of the present invention. 

In some embodiments, isolated nucleic acids which sei-ve as promoter or enhancer 
elements can be introduced in the appropriate position (generally upstream) of a non- 
20 heterologous form of a polynucleotide of the present invention so as to up or down 

regulate expression of a polynucleotide of the present invention. For example, endogenous 
promoters can be altered in vivo by mutation, deletion, and/or substitution (see, Kmiec, 
U.S. Patent 5,565,350; Zarhng et aL, WO 93/22443), or isolated promoters can be 
introduced into a plant cell in the proper orientation and distance from a cognate gene of a 
25 polynucleotide of the present invention so as to control the expression of the gene. Gene 
expression can be modulated under conditions suitable for plant growth so as to alter the 
total concentration and/or alter the composition of the polypeptides of the present 
invention in plant cell. Thus, the present invention provides compositions, and methods for 
making, heterologous promoters and/or enhancers operably linked to a native, endogenous 
30 (i.e., non -heterologous) form of a polynucleotide of the present invention. 

If polypeptide expression is desired, it is generally desirable to include a 
polyadenylation region at the 3'-end of a polynucleotide coding region. The 
polyadenylation region can be derived from the natural gene, from a variety of other plant 
genes, or from T-DNA. The 3' end sequence to be added can be derived from, for 
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example, the nopaline synthase or octopine svmthaseigenes, or alternatively from another 
plant gene, or less preferably from any other eukaryotie gene. 

An intron sequence can be added to the 5' untranslated region or the coding 
sequence of the partial coding sequence to increase the amount of the mature message that 
5 accumulates in the cytosol. Inclusion of a spliceable intron in the transcription unit in both 
plant and animal expression constructs has been shown to increase gene expression at both 
the mRNA and protein levels up to 1000-fold. Buchman and Berg, MoL Cell Biol. 8: 4395- 
4405 (]98S); CaUis et al.. Genes Dev, 1: 1183-1200(1987). Such intron enhancement of 
gene expression is typically greatest when placed near the 5' end of the transcription unit. 

10 Use of maize introns Adhl-S intron 1, 2, and 6, the Bronze-l intron are known in the art. 
Sec generally, The Maize Handbook, Chapter 1 16, Freeling and Walbot, Eds., Springer, 
New York (1994). The vector comprising the sequences from a polynucleotide of the 
present invention will typically comprise a marker gene which confers a selectable 
phenotype on plant cells. Typical vectors useful for expression of genes in higher plants 

15 are well known in the art and include vectors derived from the tumor-inducing (Ti) 

p]?Lsmrd of Agrobacterium tumefaciens described by Rogers et aL, Meth. in EnzymoL, 
153:253-277 (1987). 

A polynucleotide of the present invention can be expressed in either sense or anti- 
sense orientation as desired. It will be appreciated that control of gene expression in either 

20 sense or anti-sense orientation can have a direct impact on the observable plant 

characteristics. Antisense technology can be conveniently used to inhibit gene expression 
in plants. To accomplish this, a nucleic acid segment from the desired gene is cloned and 
operably linked to a promoter such that the anti-sense strand of RNA will be transcribed. 
The construct is then transformed into plants and the antisense strand of RNA is produced. 

25 In plant cells, it has been shown that antisense RNA inhibits gene expression by preventing 
the accumulation of mRNA which encodes the enzyme of interest, see, e.g., Sheehy et aL, 
Proc. Nai V At ad, ScL (USA) 85: 8805-8809 (1988); and Hiatt e( a/., U.S. Patent No. 
4,801,340. 

Anoihcr.mcthod of suppression is sense suppression (i.e., co-supression). 
30 Introduction of nucleic acid configured in the sense orientation has been shown to be an 

effective means by which to block the transcription of target genes. For an example of the 
use of this niclhod to modulate expression of endogenous genes see, Napoli et aL, The 
Plant Cell 2: 279-289 (1990).and U.S. Patent No. 5,034,323, 
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CalalNlic RNA molecules or ribo2\^ies can also be used to inhibit expression of 
plant genes. It is possible to design riboz\Tnes that specifically pair with virtually any 
target RNA and cleave the phosphodiester backbone at a specific location, thereby 
functionally inactivating the target RNA. In cairying out this cleavage, the ribozyme is not 
5 itself altered, and is thus capable of recycling and cleaving other molecules, making it a 
true enzyme. The inclusion of ribozyme sequences within antisense RNAs confers RNA- 
cleaving activity upon them, thereby increasing the activity of the constructs. The design 
and use of target RNA-specific ribozymes is described in Haseloff et aL, Nature 334: 585- 
591 (1988). 

10 A variety of cross-linking agents, alkylating agents and radical generating species 

as pendant groups on polynucleotides of the present invention can be used to bind, label, 
detect, and/or cleave nucleic acids. For example, Vlassov, V. V., et al. Nucleic Acids Res 
(1986) 14:4065-4076, describe covalent bonding of a single-stranded DNA fragment with 
alkylating derivatives of nucleotides complementary to target sequences. A report of 

15 similar work by the same group is that by Knorre, D. G., et al, Biochimie (1 985) 67:785- 
789. Iverson and Dervan also showed sequence-specific cleavage of single-stranded DNA 
mediated by incorporation of a modified nucleotide which was capable of activating 
cleavage {J Am Chew Soc (1987) 109:1241-1243). Meyer, R. B., et al, J Am Chem Soc 
(1 989) 111:851 7-85 1 9, effect covalent crosslinking to a target nucleotide using an 

20 alkylating agent complementary to the single-stranded target nucleotide sequence. A 

photoactivated crosslinking to single-stranded oligonucleotides mediated by psoralen was 
disclosed by Lee, B. L., et al, Biochemistjy (1988) 27:3197-3203. Use of crosslinking in 
triple-helix forming probes was also disclosed by Home, et al, J Am Chem Soc (1990) 
1 12:2435-2437. Use of N4, N4-ethanocytosine as an alkylating agent to crosslink to 

25 single-stranded oligonucleotides has also been described by Webb and Matteucci, J Am 
Chem Soc (1986) 108:2764-2765; Nucleic Acids Res (1986) 14:7661-7674; Feteritz ei al, 
J, Am. Chem, Soc. 1 13:4000 (1991). Various compounds to bind, detect, label, and/or 
cleave nucleic acids are known in the art. See, for example, U.S, Patent Nos. 5,543,507; 
5,672,593; 5,484,908; 5,256,648; and, 5,681941. 

30 

Proteins 

The isolated proteins of the present invention comprise a polypeptide having at 
least 10 amino acids from a polypeptide of the present invention (or conservative variants 
thereof) such as those encoded by any one of the polynucleotides of the present invention 
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as discussed more fully abovfe; -The proteins of the present invention or variants thereof 
can comprise any number of contiguous amino acid residues from a polypeptide of the 
present invention, wherein that number is selected from the group of integers consisting of 
from 10 to the number of residues in a full-length polypeptide of the present invention. 
5 Optionally, this subsequence of contiguous amino acids is at least 15, 20, 25, 30, 35, or 40 
amino acids in length, often at least 50, 60, 70,: SO, or 90 amino acids in length. Further, 
the number of such subsequences can be any integer selected from the group consisting of 
from 1 to 20, such as 2, 3, 4, or 5. - 

The present invention further provides a protein comprising a polypeptide having a 

10 specified sequence identity/similarity with a polypeptide of the present invention. The 

percentage of sequence identity/similarity is an integer selected from the group consisting 
of from 50 to 99. Exemplary sequence identity/similarity values include 55%, 60%, 65%, 
70%, 75%, 80%, 85%, 90%, and 95%. Sequence identity can be determined using, for 
example, the GAP, CLUSTALW, or BLAST algorithms. 

15 As those of skill will appreciate, the present invention includes, but is not limited 

to, catalytically active polypeptides of the present invention (i.e., enzymes). Catalytically 
active polypeptides have a specific activity of at least 20%, 30%, or 40%, and preferably at 
least 50%, 60%, or 70%, and most preferably at least 80%, 90%, or 95%) that of the native 
(non-synthetic), endogenous polypeptide. Further, the substrate specificity (kcat/Km) is 

20 optionally substantially similar to the native (non-synthetic), endogenous polypeptide. 
Typically, the will be at least 30%), 40%), or 50%, that of the native (non-synthetic);., 
endogenous polypeptide; and more preferably at least 60%, 70%, 80%, or 90%. Methods 
of assai^ng and quantifying measures of enzymatic activity and substrate specificity 
(kcat/Km), are well known to those of skill in the art. 

25 Generally, the proteins of the present invention will, when presented as an 

immunogen, elicit production of an antibody specifically reactive to a polypeptide of the 
present invention. Further, the proteins of the present invention will not bind to antisera 
raised against a polypeptide of the present invention which has been fully immunosorbed 
with the same polypeptide. Immunoassays for determining binding are well known to 

30 those of skill in the art. A preferred immunoassay is a competitive immunoassay. Thus, 
the proteins of the present invention can be employed as immunogens for constructing 
antibodies immunoreactive to a protein of the present invention for such exemplary 
utilities as immunoassays or protein purification techniques. 
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Expression of Proteins in Host Cells 

Using the nucleic acids of the present invention, one may express a protein of the 
present invention in a recombinantly engineered cell such as bacteria, yeast, insect, 
mammalian, or preferably plant cells. The cells produce the protein in a non-natural 
5 condition (e.g., in quantity, composition, location, and/or time), because they have been 
genetically altered through human inter\'ention to do so. 

It is expected that those of skill in the art are knowledgeable in the numerous 
expression systems available for expression of a nucleic acid encoding a protein of the 
present invention. No attempt to describe in detail the various methods known for the 

10 expression of proteins in prokaryotes or eukaryotes will be made. 

In brief summary, the expression of isolated nucleic acids encoding a protein of the 
present invention will typically be achieved by operably linking, for example, the DNA or 
cDNA to a promoter (which is either constitutive or regulatable), followed by 
incorporation into an expression vector. The vectors can be suitable for replication and 

15 integration in either prokaryotes or eukaryotes. Typical expression vectors contain 

transcription and translation terminators, initiation sequences, and promoters useful for 
regulation of the expression of the DNA encoding a protein of the present invention. To 
obtain high level expression of a cloned gene, it is desirable to construct expression vectors 
which contain, at the minimum, a strong promoter to direct transcription, a ribosome 

20 binding site for translational initiation, and a transcription/translation terminator. One of 
skill would recognize that modifications can be made to a protein of the present invention 
without diminishing its biological activity. Some modifications may be made to facilitate 
the cloning, expression, or incorporation of the targeting molecule into a fusion protein. 
Such modifications are well known to those of skill in the art and include, for example, a 

25 methionmc added at the amino terminus to provide an initiation site, or additional amino 
acids poly His) placed on either terminus to create conveniently located purification 
sequences. Restriction sites or termination codons can also be introduced. 

Synthesis of Proteins 

30 The proteins of the present invention can be constructed using non-cellular 

synthetic methods. Solid phase synthesis of proteins of less than about 50 amino acids in 
length may be accomplished by attaching the C-terminal amino acid of the sequence to an 
insoluble support followed by sequential addition of the remaining amino acids in the 
sequence. Techniques for solid phase synthesis are described by Barany and Merrifield, 
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Solid-Phase Peptide Synthesis, pp. 3-284 in T/'fe'PepildM^ Synrhesis, Biology , 

Vol. 2: Special Methods in Peptide Synthesis, Part A.; Merrifield, et aL, J. Am. Chem. Soc. 
85: 2149-2156 (1963), and Stewart et aL, Solid Phase Peptide Synthesis. 2nd ed.. Pierce 
Chem. Co., Rockford, III (1984). Proteins of greater length may be synthesized by 
5 condensation of the amino and carboxy termini of shorter fragments. Methods of forming 
peptide bonds by activation of a carboxy terminal end (e.g., by the use of the coupling 
reagent N,N*-dicycylohexylcarbodiimide) are known to those of skill. 

Purification of Proteins 

10 The proteins of the present invention may be purified by standard techniques well 

known to those of skill in the art. Recombinantly produced proteins of the present 
invention can be directly expressed or expressed as a fusion protein. The recombinant 
protein is purified by a combination of cell lysis (e.g., sonication, French press) and 
affinity chromatography. For fusion products, subsequent digestion of the fusion protein 

15 with an appropriate proteolytic enzyme releases the desired recombinant protein. 

The proteins of this invention, recombinant or synthetic, may be purified to 
substantial purity by standard techniques well known in the art, including detergent 
solubilization, selective precipitation with such substances as ammonium sulfate, column 
chromatography, immunopurification methods, and others. See, for instance, R. Scopes, 

20 Protein Purification: Principles and Practice, Springer-Verlag: New York (1982); 

Deutscher, Guide to Protein Purification, Academic Press (1990), For example, ^■ 
antibodies may be raised to the proteins as described herein. Purification from £. coli can 
be achieved following procedures described in U.S. Patent No. 4,5 1 1,503. The protein 
may then be isolated from cells expressing the protein and further purified by standard 

25 protein chemistry techniques as described herein. Detection of the expressed protein is 
achieved by methods known in the art and include, for example, radioimmunoassays, 
Westem blotting techniques or immunoprecipitation. 

Introduction of Nucleic Acids Into Host Cells 

30 The method of introducing a nucleic acid of the present invention into a host cell is 

not critical to the instant invention. Transformation or transfection methods are 
conveniently used. Accordingly, a wide variety of methods have been developed to insert 
a DNA sequence into the genome of a" host cell to obtain the transcription iand/or 



BNSDOCfD: <WO 00700S9A2 1 > 



wo 00/70059 PCT/USOO/11687 

translation of the sequence to effect phenot>pic changes in the organism. Thus, any 
method which provides for effective introduction of a nucleic acid may be employed. 

A, Plant Transformation 

A nucleic acid comprising a pol\Tiucleotide of the present invention is optionally 
introduced into a plant. Generally, the polynucleotide will first be incorporated into a 
recombinant expression cassette or vector. Isolated nucleic acid acids of the present 
invention can be introduced into plants according to techniques known in the art. 
Techniques for transforming a wide variety of higher plant species are well known and 
described in the technical, scientific, and patent literature. See, for example, Weising et 
aL, Ann. Rev. Genet. 22: 421-477 (1988), For example, the DNA construct may be 
introduced directly into the genomic DNA of the plant cell using techniques such as 
electroporation, polyethylene glycol (PEG), poration, particle bombardment, silicon fiber 
delivery, or microinjection of plant cell protoplasts or embryogenic callus. See, e.g.. 
Tomes, et al. Direct DNA Transfer into Intact Plant Cells Via Microprojectile 
Bombardment, pp. 197-2 13 in Plant Cell, Tissue and Organ Culture, Fundamental 
Methods, eds. O. L. Gamborg and G.C. Phillips. Springer-Verlag Berlin Heidelberg New 
York, 1995; see, U.S. Patent No. 5,990,387. The introduction of DNA constructs using 
PEG precipitation is described in Paszkowski et oL, EmboJ. 3: 2717-2722 (1984). 
Electroporation techniques are described in Fromm et al, Proc, Natl, Acad. Sci. (USA) 82: 
5824 (1985). Balhstic transformation techniques are described in Klein et al. Nature 327: 
70-73(1987). 

Agrobacterium tumefaciens-mcdmicd transformation techniques are well described 
in the scientific literature. See, for example Horsch et aL, Science 233: 496-498 (1984); 
Fraley et aL Proc. Natl. Acad. Sci. (USA) 80: 4803 (1983); and. Plant Moleciklar Biology: 
A Laboratory Manual, Chapter 8, Clark, Ed., Springer-Verlag, Berlin (1997). l^he DNA 
constructs may be combined with suitable T-DNA flanking regions and introduced into a 
conventional Agrobacterium tumefaciens host vector. The virulence functions of the 
Agrobacterium tumefaciens host will direct the insertion of the construct and adjacent 
marker into the plant cell DNA when the cell is infected by the bacteria. See, U.S. Patent 
No. 5,591 ,616. Although Agrobacterium is useful primarily in dicots, certain monocots 
can be transformed by Agrobacterium. For instance, Agrobacterium transformation of 
maize is described in U.S. Patent No. 5,550,3 18. 
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Other methods of transfection ortrarrsfdrmation include {\) Agrohactehwn 
rhizogeneS'mc(^\^Xcd transformation (see, e.g., Lichtenstein and Fuller In: Genetic 
Engineering, vol. 6, PWJ Rigby, Ed., London. Academic Press, 1987; and Lichtenstein, C. 
P., and Draper, J,. In: DNA Cloning, Vol. II, D. M. Glover, Ed., Oxford, IRI Press, 1985), 
5 Application PCT/US87/025 1 2 (WO 88/02405 published Apr. 7, 1 988) describes the use of 

A, rhizogenes strain A4 and its Ri plasmid along with A. tumefaciens vectors pARC8 or 
pARC16 (2) liposome-mediated DNA uptake (see, e.g.. Freeman et ai. Plant Cell Physiol. 
25: 1353 (1984)), (3) the vortexing method (see, e.g.. Kindle, Proc. Natl. Acad. Sci., {USA) 
87: 1228 (1990). 

10 DNA can also be introduced into plants by direct DNA transfer into pollen as 

described by Zhou et ah. Methods in Enzymology, 101 :433 (1983); D. Hess, Intern Rev. 
Cytol, 107:367 (1987); Luo et al. Plant Mol Biol Reporter, 6:165 

(1988). Expression of polypeptide coding genes can be obtained by injection of the DNA 
into reproductive organs of a plant as described by Pena et al, Nature, 325.:274 (1987). 

1 5 DNA can also be injected directly into the cells of immature embryos and the rehydration 
of desiccated embryos as described by Neuhaus et al, Theor, AppL Genet:, 75:30 (1987); 
and Benbrook et al, in Proceedings Bio Expo 1986, Butterworth, Stoneham, Mass., pp. 
27-54 (1986). A variety of plant viruses that can be employed as vectors are known in the 
art and include cauliflower mosaic virus (CaMV), geminivirus, brome mosaic virus, and 

20 tobacco mosaic virus. 

B, Transfection ofProkaryotes, Lower Eukary^otes, and Animal Cells 

Animal and lower eukaryotic (e.g., yeast) host cells are competent or rendered 
competent for transfection by various means. There are several well-known methods of 

25 introducing DNA into animal cells. These include: calcium phosphate precipitation, fusion 
of the recipient cells with bacterial protoplasts containing the DNA, treatment of the 
recipient cells with liposomes containing the DNA, DEAE dextran, electroporatioh, 
biolistics, and micro-injection of the DNA directly into the cells. The transfected cells are 
cultured by means well knovra in the art. Kuchler, R.J., Biochemical Methods in Cell 

30 Culture and Virology, Dowden, Hutchinson and Ross, Inc. (1977). . . 

Transgenic Plant Regeneration 

Plant cells which directly result or are derived from the nucleic acid introduction 
techniques can be cultured to regenerate a whole plant which possesses the introduced 
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genotype. Such regeneration-techniques often rely on manipulation of certain 
phytohormones in a tissue culture growth medium. Plants cells can be regenerated, e.g., 
from single cells, callus tissue or leaf discs according to standard plant tissue culture 
techniques. It is well known in the art that various cells, tissues, and organs from almost 
5 any plant can be successfully cultured to regenerate an entire plant. Plant regeneration 
from cultured protoplasts is described in Evans et al. Protoplasts Isolation and Cultt4re, 
Handbook of Plant Cell Cidtiire, Macmillan Publishing Company, New York, pp. 124-176 
(1983); and Binding, Regeneration of Plants. Plant Protoplasts ^ CRC Press, Boca Raton, 
pp. 21-73 (1985). 

10 The regeneration of plants from either single plant protoplasts or various explants 

is well known in the art. See, for example. Methods for Plant Molecular Biology, A. 
Weissbach and H. Weissbach, eds.. Academic Press, Inc., San Diego, Calif (1988). This 
regeneration and growth process includes the steps of selection of transformant cells and 
shoots, rooting the transformant shoots and growth of the plantlets in soil. For maize cell 

15 culture and regeneration see generally. The Maize Handbook^ Freeling and Walbot, Eds., 

Springer, New York (1994); Corn and Corn Improvement, edition, Sprague and Dudley 
Eds., American Society of Agronomy, Madison, Wisconsin (1988). For transformation and 
regeneration of maize see, Gordon-Kamm et al. The Plant Cell, 2:603-618 (1990). 

The regeneration of plants containing the polynucleotide of the present invention 

20 and introduced by Agrobacterium from leaf explants can be achieved as described by 

Horsch et aL, Science, 227:1229-1231 (1985). In this procedure, transformants are grown 
in the presence of a selection agent and in a medium that induces the regeneration of 
shoots in the plant species being transformed as described by Fraley et aL, Proc. NatL 
Acad. Sci. (US, A,), S0:4803 (1983). This procedure typically produces shoots within two 

25 to four weeks and these transformant shoots are then transferred to an appropriate root- 
inducing medium containing the selective agent and an antibiotic to prevent bacterial 
growth. Transgenic plants of the present invention may be fertile or sterile. 

^ One of skill will recognize that after the recombinant expression cassette is stably 
incorporated in transgenic plants and confirmed to be operable, it can be introduced into 

30 other plants by sexual crossing. Any of a number of standard breeding techniques can be 
used, depending upon the species to be crossed. In vegetatively propagated crops, mature 
transgenic plants can be propagated by the taking of cuttings or by tissue culture 
techniques to produce multiple identical plants. Selection of desirable transgenics is made 
and new varieties are obtained and propagated vegetatively for conunercial use. In seed 
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propagated crops, mature transgenic plants can be self crossed to produce a homozygous 
inbred plant. The inbred plant produces seed containing the newly introduced heterologous 
nucleic acid. These seeds can be grown to produce plants that would produce the selected 
phenotype. Parts obtained from the regenerated plant, such as flowers, seeds, leaves, 
5 branches, fruit, and the like are included in the invention, provided that these parts 

comprise cells comprising the isolated nucleic acid of the present invention. Progeny and 
variants, and mutants of the regenerated plants are also included within the scope of the 
invention, provided that these parts comprise the introduced nucleic acid sequences. 
Transgenic plants expressing a polynucleotide of the present invention can be 
10 screened for transmission of the nucleic acid of the present invention by, for example, 

standard immunoblot and DNA detection teclmiques. Expression at the RNA level can be 
determined initially to identify and quantitate expression-positive plants. Standard 
techniques for RNA analysis can be employed and include PGR amplification assays using 
oligonucleotide primers designed to amplify only the heterologous KNA templates and 
15 solution hybridization assays using heterologous nucleic acid-specific probes. The RNA- 
positive plants can then analyzed for protein expression by Western immunoblot analysis 
using the specifically reactive antibodies of the present invention. In addition, in situ 
hybridization and immunocytochemistry according to standard protocols can be done 
using heterologous nucleic acid specific polynucleotide probes and antibodies, 
20 respectively, to localize sites of expression within transgenic tissue. Generally, ia number 
of transgenic lines are usually screened for the incorporated nucleic acid to identify and 
select plants with the most appropriate expression profiles. 

A preferred embodiment is a transgenic plant that is homozygous for the added 
heterologous nucleic acid; i.e., a transgenic plant that contains two added nucleic acid 
25 sequences, one gene at the same locus on each chromosome of a chromosome pair. A 

homozygous transgenic plant can be obtained by sexually mating (selfing) a heterozygous 
transgenic plant that contains a single added heterologous nucleic acid, germinating some 
of the seed produced and analyzing the resulting plants produced for altered expression of 
a polynucleotide of the present invention relative to a control plant (i.e., native, non- 
30 transgenic). Back-crossing to a parental plant and out-crossing with a non- transgenic 
plant are also contemplated. \' 

Modulating Polypeptide Levels and/or Composition . 
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The present invention further provides a method for modulating (i.e., increasing or 
decreasing) the concentration or ratio of the polypeptides of the present invention in a 
plant or part thereof Modulation can be effected by increasing or decreasing the 
concentration and/or the ratio of the polypeptides of the present invention in a plant. The 
5 method comprises introducing into a plant cell a recombinant expression cassette 
comprising a polynucleotide of the present invention as described above to obtain a 
transgenic plant cell, culturing the transgenic plant cell under transgenic plant cell growing 
conditions, and inducing or repressing expression of a polynucleotide of the present 
invention in the transgenic plant for a time sufficient to modulate concentration and/or the 

1 0 ratios of the polypeptides in the transgenic plant or plant part. 

In some embodiments, the concentration and/or ratios of polypeptides of the 
present invention in a plant may be modulated by altering, in vivo or in vitro, the promoter 
of a gene to up- or down-regulate gene expression. In some embodiments, the coding 
regions of native genes of the present invention can be altered via substitution, addition, 

15 insertion, or deletion to decrease activity of the encoded enzyme. See, e.g., Kmiec, U.S. 

Patent 5,565,350; Zarling et al, WO 93/22443; Briggs, et aL, U.S. Patent 5,962,764. And 
in some embodiments, an isolated nucleic acid (e.g., a vector) comprising a promoter 
sequence is transfected into a plant cell. Subsequently, a plant cell comprising the 
promoter operably linked to a polynucleotide of the present invention is selected for by 

20 means known to those of skill in the art such as, but not limited to. Southern blot, DNA 
sequencing, or PGR analysis using primers specific to the promoter and to the gene and 
detecting amplicons produced therefrom. A plant or plant part altered or modified by the 
foregoing embodiments is grown under plant forming conditions for a time sufficient to 
modulate the concentration and/or ratios of polypeptides of the present invention in the 

25 plant. Plant forming conditions are well known in the art and discussed briefly, supra. 

In general, concentration or the ratios of the polypeptides is increased or decreased 
by at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% relative to a native 
control plant, plant part, or cell lacking the aforementioned recombinant expression 
cassette. Modulation in the present invention may occur during and/or subsequent to 

30 growth of the plant to the desired stage of development. Modulating nucleic acid 

expression temporally and/or in particular tissues can be controlled by employing the 
appropriate promoter operably linked to a polynucleotide of the present invention in, for 
example, sense or antisense orientation as discussed in greater detail, supra. Induction of 
expression of a polynucleotide of the present invention can also be controlled by 
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exogenous administration of an effective amount-of inducing compound. Inducible 
promoters and inducing compounds which activate expression from these promoters are 
well known in the art. In preferred embodiments, the polypeptides of the present invention 
are modulated in monocols, particularh' maize. . 

5 

UTRs and Codon Preference 

In general, translational efficiency has been found to be regulated by specific . 
sequence elements in the 5' non-coding or untranslated region (5' UTR) of the RNA. 
Positive sequence motifs include translational initiation consensus sequences (Kozak, 

1 0 Nucleic Acids Res. 1 5:8 125 (1 987)) and the 7-methylguanosine cap structure (Drummond 
et al, Nucleic Acids Res, 13:7375 (1985)). Negative elements include stable 
intramolecular 5' UTR stem-loop structures (Muesing et al. Cell 48:691 (1987)) and AUG 
sequences or short open reading frames preceded by an appropriate AUG in the 5 ' UTR 
(Kozak, supra. Rao et al, Mol and Cell Biol, 8:284 (1988)). Accordingly, the present 

15 invention provides 5' and/or 3' untranslated regions for modulation of translation of , 
heterologous coding sequences. 

Further, the polypeptide-encoding segments of the polynucleotides of the present 
invention can be modified to alter codon usage. Altered codon usage can be employed to 
alter translational efficiency and/or to optimize the coding sequence for expression in a 

20 desired host such as to optimize the codon usage in a heterologous sequence for expression 
in maize. Codon usage in the coding regions of the polynucleotides of the present 
invention can be analyzed statistically using commercially available software packages 
such as "Codon Preference" available from the University of Wisconsin Genetics 
Computer Group (see Devereaux et al.. Nucleic Acids Res. 12: 387-395 (1984)). or. 

25 MacVector 4,1 (Eastman Kodak Co., New Haven, Corm.). Thus, the present invention 

provides a codon usage frequency characteristic of the coding region of at least one of the 
polynucleotides of the present invention. The number of polynucleotides that can be used 
to determine a codon usage frequency can be any integer from 1 to the number of 
polynucleotides of the present invention as provided herein. Optionally^ the 

30 polynucleotides will be full-length sequences. An exemplary number of sequences for 
statistical analysis can be at least 1 , 5, 1 0, 20, 50, or 1 00. 

Sequence Shuffling . 
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The present invention provides methods for sequence shufTling using 
polynucleotides of the present invention, and compositions resulting therefrom. Sequence 
shuffling is described in PCT publication No. WO 97/20078. See also, Zhang, J.- H., et ai 
Proc. Natl Acad, Sci. USA 94:4504-4509 (1997), Generally, sequence shuffling provides 
a means for generating libraries of polxTiucleotides having a desired characteristic which 
can be selected or screened for. Libraries of recombinant polynucleotides are generated 
from a population of related sequence pol^^nucleotides w^hich comprise sequence regions 
which have substantial sequence identity and can be homologously recombined in vitro or 
in vivo. The population of sequence-recombined polynucleotides comprises a 
subpopulation of polynucleotides which possess desired or advantageous characteristics 
and which can be selected by a suitable selection or screening method. The characteristics 
can be any property or attribute capable of being selected for or detected in a screening 
system, and may include properties of: an encoded protein, a transcriptional element, a 
sequence controlling transcription, RNA processing, flNA stability, chromatin 
conformation, translation, or other expression property of a gene or transgene, a replicative 
element, a protein -binding element, or the like, such as any feature which confers a 
selectable or detectable property. In some embodiments, the selected characteristic will be 
a decreased Km and/or increased Kcat over the wild-type protein as provided herein. In 
other embodiments, a protein or polynucleotide generated from sequence shuffling will 
have a ligand binding affinity greater than the non-shuffled wild-type polynucleotide. The 
increase in such properties can be at least 110%, 120%, 130%, 140% or at least 150% of 
the wild-type value. 

Generic and Consensus Sequences 

Polynucleotides and polypeptides of the present invention fiirther include those 
having: (a) a generic sequence of at least two homologous polynucleotides or polypeptides, 
respectively, of the present invention; and, (b) a consensus sequence of at least three 
homologous polynucleotides or polypeptides, respectively, of the present invention. The 
generic sequence of the present invention comprises each species of polypeptide or 
polynucleotide embraced by the generic polypeptide or polynucleotide sequence, 
respectively. The individual species encompassed by a polynucleotide having an amino 
acid or nucleic acid consensus sequence can be used to generate antibodies or produce 
nucleic acid probes or primers to screen for homologs in other species, genera, families, 
orders, classes, phyla, or kingdoms. For example, a polynucleotide having a consensus 
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sequence from a gene family of Ze<7 niays can be. used to generate antibody or nucleic acid 
probes or primers to other Gramineae species such as wheat, rice, or sorghum. 
Alternatively, a polvTiucleotide having a consensus sequence generated from orthologous 
genes can be used to identify or isolate orlhologs of other taxa. Typically, a 
5 polynucleotide having a consensus sequence will be at least 9, 10, 15, 20, 25, 30, or 40 
amino acids in length, or 20, 30, 40, 50, 100, or 150 nucleotides in length. As those of 
skill in the art are aware, a conservative amino acid substitution can be used for amino 
acids which differ amongst aligned sequence but are from the same conservative 
substitution group as discussed above, .Optionally, no more than 1 or 2 conservative amino 

10 acids are substituted for each 10 amino acid length of consensus sequence. 

Similar sequences used for generation of a consensus or generic sequence include 
any number and combination of allelic variants of the same gene, orthologous, or 
paralogous sequences as provided herein. Optionally, similar sequences used in generating 
a consensus or generic sequence are identified using the BLAST algorithm's smallest.sum 

15 probability (P(N)). Various suppliers of seqiience-analysis software are hsted in chapter 7 
of Current Protocols in Molecular Biolog}\ F..M. Ausubel e/ a/., Eds., Current Protocols, a 
joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. 
(Supplement 30). A polynucleotide sequence is considered similar to a reference sequence 
if the smallest sum probability in a comparison of the test nucleic acid to the reference 

20 nucleic acid is less than about 0.1, more preferably less than about 0.01, or 0.001, and most 
preferably less than about 0.0001, or 0.00001 . Similar polynucleotides can be aligned;and 
a consensus or generic sequence generated using multiple sequence alignment software 
available from a number of commercial suppliers such as the Genetics Computer Group's 
(Madison, \V1) IMLEUP software, Vector NTI's (North Bethesda, MD) ALIGNX, or 

25 Genccodc\s ( Ann Arbor, MI) SEQUENCHER. Conveniently, default parameters of such 
software can be used to generate consensus or generic sequences. 

Machine Applications 

The present invention provides machines, data structures, and processes for 
30 modeling or analwing the polynucleotides and polypeptides of the present invention. 

A. Machines: Data. Data Structures, Processes, and Functions 

The present invention provides a machine having a membiry corhprising; 1 ) data 
repfresenting a sequence of a polynucleotide or polypeptide of the present invention, 2) a 
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data structure which reflects the underl ying organization and structure of the data and 
facilitates program access to data elements corresponding to logical sub-components of the 
sequence, 3) processes for effecting the use. analysis, or modeling of the sequence, and 4) 
optionally, a function or utility for the pol\T»ucleotide or polypeptide. Thus, the present 
5 invention provides a memory for storing data that can be accessed by a computer 
programmed to implement a process for effecting the use. analyses, or modeling of a 
sequence of a polynucleotide, with the memory comprising data representing the sequence 
of a polynucleotide of the present invention. 

The machine of the present invention is typically a digital computer. The term 

10 "computer"' includes one or several desktop or portable computers, computer 

workstations, servers (including intranet or internet servers), mainframes, and any 
integrated system comprising any of the above irrespective of whether the processing, 
memory, input, or output of the computer is remote or local, as well as any networking 
interconnecting the modules of the computer. The term "computer" is exclusive of 

1 5 computers of the United States Patent and Trademark Office or the European Patent Office 
when data representing the sequence of polypeptides or polynucleotides of the present 
invention is used for patentability searches. 

The present invention contemplates providing as data a sequence of a 
polynucleotide of the present invention embodied in a computer readable medium. As 

20 those of skill in the art will be aware, the form of memory of a machine of the present 

invention, or the particular embodiment of the computer readable medium, are not critical 
elements of the invention and can take a variety of forms. The memory of such a machine 
includes, but is not limited to, ROM, or RAM, or computer readable media such as, but not 
limited to, magnetic media such as computer disks or hard drives, or media such as CD- 

25 ROMs, DVDs, and the like. 

The present invention further contemplates providing a data structure that is also 
contained in memory. The data structure may be defined by the computer programs that 
define the processes (see below) or it may be defined by the programming of separate data 
storage and retrieval programs subroutines, or systems. Thus, the present invention 

30 provides a memory for storing a data structure that can be accessed by a computer 
programmed to implement a process for effecting the use, analysis, or modeling of a 
sequence of a polynucleotide. The memory comprises data representing a polynucleotide 
having the sequence of a polynucleotide of the present invention. The data is stored within 
memory. Further, a data structure, stored within memory, is associated with the data 
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reflecling the underlying organization^and structure of the data to facilitate program access 
to data elements corresponding to logical sub-components of the sequence. The data 
structure enables the polynucleotide to be identified and manipulated by such programs. 
In a further embodiment, the present invention provides a data structure that 

5 contains data representing a sequence of a polynucleotide of the present invention stored 
within a computer readable medium. The data structure is organized to reflect the logical 
structuring of the sequence, so that the sequence is easily analyzed by software programs 
capable of accessing the data structure. In particular, the data structures of the present 
invention organize the reference sequences of the present invention in a manner which 

10 allows software tools to perform a wide variety of analyses using logical elements and 
sub-elements of each sequence. 

An example of such a data structure resembles a layered hash table, where in one 
dimension the base content of the sequence is represented by a string of elements A, T, C, 
G and N. The direction from the 5' end to the 3' end is reflected by the order from the 

1 5 position 0 to the position of the length of the string minus one. Such a string, 

corresponding to a nucleotide sequence of interest, has a certain number of substrings, 
each of which is delimited by the string position of its 5' end and the string position of its 
3' end within the parent string. In a second dimension, each substring is associated with or 
pointed to one or multiple attribute fields. Such attribute fields contain annotations to the 

20 region on the nucleotide sequence represented by the substring. 

For example, a sequence under investigation is 520 bases long and represented^ by a 
string named SeqTarget. There is a minor groove in the 5' upstream non-coding region 
from position 12 to 38, which is identified as a binding site for an enhancer protein HM-A, 
which in turn will increase the transcription of the gene represented by SeqTarget. Here, 

25 the substring is represented as (12, 38) and has the following attributes: [lipstream 

uncoded], [minor groove], [HM-A binding] and [increase transcriptit)n upon binding by 
HM-A]. Similarly, other types of information can be stored, and structured in this manner, 
such as information related to the whole sequence, e.g., whether the sequence is a full 
length viral gene, a mammalian house keeping gene or an EST from clone X, information 

30 related to the 3' down stream non-coding region, e.g., hair pin structure, and information 
related to various domains of the coding region, e.g.. Zinc finger. 

This data structure is an open structure and is robust enough to accommodate 
newly generated data and acquired knowledge. Such a structure is also a flexible structure. 
It can be trimmed down to a 1-D string to facilitate data mining and analysis steps, such as 
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clustering, repeat-masking, and HMM analysis. Meanwhile, such a data structure also can 
extend the associated attributes into multiple dimensions. Pointers can be established 
among the dimensioned attributes when needed to facilitate data management and 
processing in a comprehensive genomics knowledgebase. Furthermore, such a data 
5 structure is object-oriented. Polymorphism can be represented by a family or class of 
sequence objects, each of which has an internal structure as discussed above. The 
common traits are abstracted and assigned to the parent object, whereas each child object 
represents a specific variant of the family or class. Such a data structure allows data to be 
efficiently retrieved, updated and integrated by the software applications associated with 

1 0 the sequence database and/or knowledgebase. 

The present invention contemplates providing processes for effecting analysis and 
modeling, which are described in the following section. 

Optionally, the present invention further contemplates that the machine of the 
present invention will embody in some manner a utility or function for the polynucleotide 

15 or polypeptide of the present invention. The function or utility of the polynucleotide or 
polypeptide can be a function or utility for the sequence data, per se^ or of the tangible 
material. Exemplary function or utilities include the name (per International Union of 
Biochemistry and Molecular Biology rules of nomenclature) or function of the enzyme or 
protein represented by the polynucleotide or polypeptide of the present invention; the 

20 metabolic pathway of the protein represented by the polynucleotide or polypeptide of the 
present invention; the substrate or product or structural role of the protein represented by 
the pol>Tiuclcolidc or polypeptide of the present invention; or, the phenotj^^e (e.g., an 
agronomic or pharmacological trait) affected by modulating expression or activity of the 
protein represented by the polynucleotide or polypeptide of the present invention. 

25 

B, Cnntputcr Analysis and Modeling 

The present invention provides a process of modeling and analyzing data 
representative of a polynucleotide or polypeptide sequence of the present invention. The 
process comprises entering sequence data of a polynucleotide or polypeptide of the present 
30 invention into a machine having a hardware or software sequence modeling and analysis 
system, developing data structures to facilitate access to the sequence data, manipulating 
the data to model or analyze the structure or activity of the polynucleotide or polypeptide, 
and displaying the results of the modeling or analysis. Thus, the present invention 
provides a process for effecting the use, analysis, or modeling of a polynucleotide 

BNSDCXID: <WO 0070059A2_L> 



wo 00/70059 PCTAJSOO/1 1 687 

- 47 - 

sequence or its derived peptide sequenee thrdugh use of a computer having a memory. 
The process comprises 1) placing into the memory data representing a polynucleotide 
having the sequence of a polynucleotide of the present invention, 

developing within the memory a data structure associated with the data and reflecting the 
5 underlying organization and structure of the data to facilitate program access to data 

elements corresponding to logical sub-components of the sequence, 2) programming the 
computer with a program containing instructions sufficient to implement the process for 
effecting the use, analysis, or modeling of the polynucleotide sequence or the peptide 
sequence, and, 3) executing the program on the computer while granting the program 

10 access to the data and to the data structure within the memory. 

A variety of modeling and analytic tools are well known in the art and available 
commercially. Included amongst the modeling/analysis tools are methods to: 1) recognize 
overlapping sequences (e.g., from a sequencing project) with a polynucleotide of the 
present invention and create an alignment called a "contig''; 2) identify restriction enzyme 

1 5 sites of a polynucleotide of the present invention; 3) identify the products of a Tl - 
ribonuclease digestion of a polynucleotide of the present invention; 4) identify PCR 
primers with minima] self-complementarity; 5) compute pairwise distances between 
sequences in an alignment, reconstruct phylogentic trees using distance methods, and 
calculate the degree of divergence of two protein coding regions; 6) identify patterns such 

20 as coding regions, terminators, repeats, and other consensus patterns in polynucleotides of 
the present invention; 7) identify RNA secondary structure; 8) identify sequence motifs, 
isoelectric point, secondary structure, hydrophobicity, and antigenicity in polypeptides of 
the present invention; 9) translate polynucleotides of the present invention and 
backtranslate polypeptides of the present invention; and 10) compare two protein or 

25 nucleic acid sequences and identifying points of similarity or dissimilarity between them. 

The processes for effecting analysis and modeling can be produced independently 
or obtained from commercial suppliers. Exemplary analysis and modeling tools are \ 
provided in products such as InforMax's (Bethesda, MD) Vector NTI Suite (Version 5.5), 
Intelligenetics' (Mountain View, CA) PC/Gene program, and Genetics Coriiputer Group's 

30 (Madison, WI) Wisconsin Package (Version 10.0); these tools, and the functions they 
perform, (as provided and disclosed by the programs and accompanying literature) are 
incorporated herein by reference and are described in more detail in section C which 
follows. 
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Thus, in a further embc>dimenU the present invention provides a machine-readable 
medium containing a computer program and data, comprising a program stored on the 
medium containing instructions sufficient to implement a process for effecting the use, 
analysis, or modeling of a representation of a polynucleotide or peptide sequence. The 
5 data stored on the medium represents a sequence of a polynucleotide having the sequence 
of a polynucleotide of the present invention. The medium also includes a data structure 
reflecting the underlying organization and structure of the data to facilitate program access 
to data elements corresponding to logical sub-components of the sequence, the data 
structure being inherent in the program and in the way in which the program organizes and 
10 accesses the data. 

C Homology Searches 

As an example of such a comparative analysis, the present invention provides a 
process of identifying a candidate homologue (i.e,, an ortholog or paralog) of a 

15 polynucleotide or polypeptide of the present invention. The process comprises entering 
sequence data of a polynucleotide or polypeptide of the present invention into a machine 
having a hardware or software sequence analysis system, developing data structures to 
facilitate access to the sequence data, manipulating the data to analyze the structure the 
polynucleotide or polypeptide, and displaying the results of the analysis. A candidate 

20 homologue has statistically significant probability of having the same biological function 
(e.g., catalyzes the same reaction, binds to homologous proteins/nucleic acids, has a 
similar structural role) as the reference sequence to which it is compared. Accordingly, the 
polynucleotides and polypeptides of the present invention have utihty in identifying 
homologs in animals or other plant species, particularly those in the family Gramineae 

25 such as, but not limited to, sorghum, wheat, or rice, 

The process of the present invention comprises obtaining data representing a 
polynucleotide or polypeptide test sequence. Test sequences can be obtained from a 
nucleic acid of an animal or plant. Test sequences can be obtained directly or indirectly 
firom sequence databases including, but not limited to, those such as: GenBank, EMBL, 

30 GenSeq, SWISS-PROT, or those available on-line via the UK Human Genome Mapping 
Project (HGMP) GenomeWeb. In some embodiments the test sequence is obtained from a 
plant species other than maize whose function is uncertain but will be compared to the test 
sequence to determine sequence similarity or sequence identity. The test sequence data is 
entered into a machine, such as a computer, containing: i) data representing a reference 
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sequence and, ii) a hardware;or software sequence coniparison system to compare the 
reference and test sequence for sequence similarity or identity. 

Exemplary sequence comparison systems are provided for in sequence analysis 
software such as those provided by the Genetics Computer Group (Madison, WI) or 
5 InforMax (Bethesda, MD), or Intelligenetics (Mountain View, CA). Optionally, sequence 
comparison is established using the BLAST or GAP suite of programs. Generally, a 
smallest sum probability value (P(N)) of less than 0.1, or alternatively, less than 0.01, 
0.001, 0.0001, or 0.00001 using the BLAST 2.0, suite of algorithms under default 
parameters identifies the test sequence as a candidate homologue (i.e., an allele, ortholog, 

1 0 or paralog) of the reference sequence. Those of skill in the art will recognize that a 

candidate homologue has an increased statistical probability of having the same or similar 
function as the gene/protein represented by the test sequence. 

The reference sequence can be the sequence of a polypeptide or a polynucleotide of 
the present invention. The reference or test sequence is each optionally at least 25 amino 

15 acids or at least 100 nucleotides in length, The length of the reference or test sequences - 
can be the length of the polynucleotide or polypeptide described, respectively, above in the 
sections entitled "Nucleic Acids" (particularly section (g)), and "Proteins". As those of 
skill in the art are aware, the greater the sequence identity/similarity between a reference 
sequence of known function and a test sequence, the greater the probability that the test 

20 sequence will have the same or similar function as the reference sequence; The results of 
the comparison between the test and reference sequences are outputted (e.g., displayed, 
printed, recorded) via any one of a number of output devices and/or media (e.g., computer 
monitor, hard copy, or computer readable medium). ' 

25 Detection of Nucleic Acids 

The present invention further provides methods for detecting a polynucleotide of 
the present invention in a nucleic acid sample suspected of containing a polynucleotide of 
the present invention, such as a plant cell lysate, particularly a lysate of maize. In some 
embodiments, a cognate gene of a polynucleotide of the present invention or portion. 
30 thereof can be amplified prior to the step of contacting the nucleic acid sample with a - 
polynucleotide of the present invention. The nucleic acid sample is contacted with the 
polynucleotide to form a hybridization complex. The polynucleotide hybridizes under 
stringent conditions to a gene encoding a polypeptide of the present invention. Formation 
of the hybridization complex is used to detect a gene encoding a polypeptide of the present 
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invention in the nucleic acid sample; Those of skill will appreciate that an isolated nucleic 
acid comprising a polynucleotide of the present invention should lack cross-hybridizing 
sequences in common with non-target genes that would yield a false positive result. 
Detection of the hybridization complex can be achieved using any number of well known 
5 methods. For example, the nucleic acid sample, or a portion thereof, may be assayed by 
hybridization formats including but not limited to, solution phase, solid phase, mixed 
phase, or in situ hybridization assays. 

Detectable labels suitable for use in the present invention include any composition 
detectable by spectroscopic, radioisotopic, photochemical, biochemical, immunochemical, 

10 electrical, optical or chemical means. Useful labels in the present invention include biotin 
for staining with labeled streptavidin conjugate, magnetic beads, fluorescent dyes, 
radiolabels, enzymes, and colorimetric labels. Other labels include ligands which bind to 
antibodies labeled with fluorophores, chemiluminescent agents, and enzymes. Labeling 
the nucleic acids of the present invention is readily achieved such as by the use of labeled 

15 PCR primers. 

Although the present invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be obvious that 
certain changes and modifications may be practiced within the scope of the appended 
claims. 

20 

Example 1 

This example describes the construction of a cDNA library. 

Total RNA can be isolated from maize tissues with TRIzol Reagent (Life 
Technology Inc. Gaithersburg, MD) using a modification of the guanidine 
25 isothiocyanate/acid-phenol procedure described by Chomczynski and Sacchi 

(Chomczynski, P,, and Sacchi, N. Anal. Biochem, 162, 156 (1987)). In brief, plant tissue 
samples are pulverized in liquid nitrogen before the addition of the TRIzol Reagent, and 
then further homogenized with a mortar and pestle. Addition of chloroform followed by 
centrifugation is conducted for separation of an aqueous phase and an organic phase. The 
30 total RNA is recovered by precipitation with isopropyl alcohol from the aqueous phase. 

The selection of poly(A)+ RNA from total RNA can be performed using 
PolyATact system (Promega Coiporation, Madison, WI), Biotinylated oligo(dT) primers 
are used to hybridize to the 3' poly(A) tails on mRNA. The hybrids are captured using 
streptavidin coupled to paramagnetic particles and a magnetic separation stand. The 



BNSDOCID: <WO 0070069A2 ( > 



wo 00/70059 PCT/USOO/1 1 687 

- 51 - 

mRNA is then washed at high^ stringency 'conditions and eluted by RNase-free deionized 
water. 

cDNA synthesis and construction of unidirectional cDNA libraries can be 
accomplished using the Superscript Plasmid System (Life Technology Inc. Gaithersburg, 
5 MD). The first strand of cDNA is synthesized by priming an oligo(dT) primer containing 
a Not I site. The reaction is catalyzed by Superscript Reverse Transcriptase II at 45°C. 
The second strand of cDNA is labeled with alpha-^^P-dCTP and a portion of the reaction 
analyzed by agarose gel electrophoresis to determine cDNA sizes. cDNA molecules 
smaller than 500 base pairs and unligated adapters are removed by Sephacryl-S400 

10 chromatography. The selected cDNA molecules are ligated into pSPORTl vector in 
between of Not I and Sal I sites. 

Alternatively, cDNA libraries can be prepared by any one of many methods 
available. For example, the cDNAs may be introduced into plasmid vectors by first 
preparing the cDNA libraries in Uni-ZAP''^^ XR vectors according to the manufacturer's 

1 5 protocol (Stratagene Cloning Systems, La Jolla, CA). The Uni-ZAP*^*^ XR libraries are 
converted into plasmid libraries according to the protocol provided by Stratagene. Upon 
conversion, cDNA inserts will be contained in the plasmid vector pBluescript. In addition, 
the cDNAs may be introduced directly into precut Bluescript II SK(+) vectors (Stratagene) 
using T4 DNA ligase (New England Biolabs), followed by transfection into DHIOB cells 

20 according to the manufacturer's protocol (GIBCO BRL Products). Once the cDNA inserts 
are in plasmid vectors, plasmid DNAs are prepared from randomly picked bacterial 
colonies containing recombinant pBluescript plasmids, or the insert cDNA sequences are 
amplified via polymerase chain reaction using primers specific for vector sequences 
flanking the inserted cDNA sequences. Arriplified insert DNAs or plasmid DNAs are 

25 sequenced in dye-primer sequencing reactions to generate partial cDNA sequences 

(expressed sequence tags or "ESTs"; see Adarns et al,{\99\) Science 252:1651-1656). 
The resulting ESTs are analyzed using a Perkin Elmer Model 377 fluorescent sequencer. 

Example 2 

30 This method describes construction of a full-length enriched cDNA library. 

An enriched full-length cDNA library can be constructed using one of two 
variations of the method of Caminci et al. Genomics 2>1: 327-336, 1996. These variations 
are based on chemical introduction of a biotin group into the diol residue of the 5' cap 
structure of eukaryotic mRNA to select full-length first strand cDN A. The selection 
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occurs by trapping the biotin residue at the cap sites using streptavidin-coaled magnetic 
beads followed by RNase I treatment to eliminate incompletely synthesized cDNAs. 
Second strand cDNA is synthesized using established procedures such as those provided 
in Life Technologies' (Rockville, MD) ''Superscript Plasmid System for cDNA Synthesis 
5 and Plasmid Cloning'' kit. Libraries made by this method have been shown to contain 
50% to 70% full-length cDNAs. 

The first strand synthesis methods are detailed below. An asterisk denotes that the 
reagent was obtained from Life Technologies. Inc. 



A. First strand cDNA sytuhesis method J (with trehalose) 


niKNA(lOug) 


25fil 


*Not I primer (5ug) 


10^1 


*5x 1^' strand buffer 


43)111 


*0.1mDTT 


lOyl 


*dNTP mix 1 0mm 


10|il 


BSA lOug/^l 


1^1 


Trehalose (saturated) 


59.2^1 


RNase inhibitor (Promega) 


1.8^1 


♦Superscript II RT 200u/^l 


20|il 


100 % glycerol 


18^1 


Water 


7^1 



The mRNA and Not I primer are mixed and denatured at 65^C for 10 min. They 
are then chilled on ice and other components added to the tube. Incubation is at 45^C for 2 
25 min. Twenty microliters of RT (reverse transcriptase) is added to the reaction and start 
program on the thermocycler (MJ Research, Waltham, MA): 
Step! 45^C lOmin 

Step 2 45'*C -OJ^'C/cycle , 2 seconds/cycle 

Step 3 go to 2 for 33 cycles 

30 Step 4 35*^0 5min 

Step 5 45*'C 5min 

Step 6 45T O^'^C/cycle, 1 sec/cycle 

Step 7 go to 7 for 49 cycles 
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55T 0. 1 ''C/cycle. 1 2 sec/cycle 
go to 8 for 49 cycles 
55"C 2Tnin 
60°C 2mm 
go to 1 1 for 9 times 
4°C forever 
end 

B. First strand cDNA synthesis method 2 



10 mRNA(lOiag) 25^1 

water 30^1 

*Not I adapter primer (5|xg) 1 0|il 
65°C for lOmin, chill on ice, then add following reagents, 

♦5x first buffer 20nl 

15 *0.1MDTT lOjal 

*10mMdNTPmix 5fjl 



Step 8 
Step 9 
Step 10 
Stepl 1 
5 Step 12 
Step 13 
Step 14 



Incubate at 45°C for 2min, then add 10^1 of *Superscript II RT (200u/^l), start the 

following program: 
20 Stepl 45"C for6 sec, -0.1°C/cycle 

Step 2 go to 1 for 99 additional cycles 

Step 3 35"C forSmin 

Step 4 45''C for 60 min 

Steps 50°CforlOmin 
25 Step 6 4"C forever ,, 

Step 7 end 

After the 1^* strand cDNA synthesis, the DNA is extracted by phenol according to 
standard procedures, and then precipitated in NaOAc and ethanol, and stored in -20°C. 



30 



C Oxidization of the diol group of niRNA for biotin labeling 

First strand cDNA is spun down and washed once with 70% EtOH. The pellet 
resuspended in 23.2 \i\ of DEPC treated water and put on ice. Prepare 100 mM of NaI04 
freshly, and then add the following reagents: 
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mRNA: 1 cDNA (start with 20)ag niRNA ) 46.4(.il 
lOOmM NaI04 (freshly made) 2.5^1 
NaOAc 3M pH4,5 1.1 |al 



5 To make 100 mM NaI04, use 21.39|ag of NaI04 for l|al of water. 
Wrap the tube in a foil and incubate on ice for 45min. 
After the incubation, the reaction is then precipitated in: 
SMNaCl 10^1 
20%SDS 0.5|al 
10 isopropanol 61 yd 

Incubate on ice for at least 30 min, then spin it down at max speed at 4°C for 30 min and 
wash once with 70% ethanol and then 80% EtOH. 

D. Biotinylation of the mRNA diol group 
15 Resuspend the DNA in 1 lOjil DEPC treated water, then add the following reagents: 
20% SDS 5 \i\ 

2MNaOAcpH6.1 5 \i\ 

lOmm biotin hydrazide (freshly made) 300 |il 
Wrap in a foil and incubate at room temperature overnight. 

20 

£. RNase I treatment 
Precipitate DNA in: 
5M NaCl 

2M NaOAc pH 6.1 
25 biotinylated mRNAicDNA 
100% EtOH (2.5Vol) 



l0^l 
75^l 

420|al 
1262.5|il 



(Perform this precipitation in two tubes and split the 420 ^1 of DNA into 210 jil each, add 
5^1 of 5M NaCl, 37.5|al of 2M NaOAc pH 6.1, and 631.25 jal of 100% EtOH). 
30 Store at -20''C for at least 30 min. Spin the DNA down at A^'C at maximal speed for 30 

min. and wash with 80% EtOH twice, then dissolve DNA in 70jil RNase free water. Pool 
two tubes and end up with 140 fil. 
Add the following reagents: 
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RNaseOne lOU/Ml 40m] * - . . 

l''cDNA:RNA 140|al 

1 OX buffer 20m1 
Incubate at 37*^0 for 15min. 

5 Add 5}il of 40Mg/|iI yeast tRNA to each sample for capturing, 

F. Full length cDNA capturing 

Blocking the beads with yeast tRNA: 

Beads Iml 
10 Yeast tRNA 40Mg/Ml 5|al 

Incubate on ice for 30min with mixing, wash 3 times with 1ml of 2M NaCl , 

SOmmEDTA, pH 8.0. 

Resuspend the beads in ^00[x\ of 2M NaCl , 50mm EDTA, pH 8.0, add RNase I 

treated sample 200(il, and incubate the reaction for 30min at room temperature. 
15 Capture the beads using the magnetic stand, save the supernatant, and start following 

washes: 

2 washes with 2M NaCl , 50mm EDTA, pH 8.0, 1 ml each time, 
1 wash with 0,4% SDS, 50Mg/ml tRNA, 

1 wash with 10mm Tris-Cl pH 7.5, 0.2mm EDTA, 10mm NaCl, 20% glycerol, 
20 1 wash with 50|ag/ml tRNA, 
1 wash with 1'* cDNA buffer 



G. Second strand cDNA synthesis 



Resuspend the beads in: 

25 *5X first buffer 8|il 

*0.1mMDTT 4^1 

*IOmmdNTPmix Sial 

*5X2nd buffer 60^1 

*Exoli Ligase 1 0U/m1 2^1 

30 *E.coli DNA polymerase 1 OU/^l Sjal 

*E. coli RNaseH lU/^il 2^1 

P32dCTP lO^ci/^l 2^1 

Or water up to 300|xl 208jal 
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Incubate at 1 6^^C for 2hr with mixing the reaction in every 30 min. 

Add 4^x1 of T4 DNA polymerase and incubate for additional 5 min at 16^C. 

Elute 2""^ cDNA from the beads. 
5 Use a magnetic stand to separate the 2^^^^ cDNA from the beads, then resuspend the beads in 
200|j.l of water, and then separate agam, pool the samples (about 500^1). 
Add 200 |il of water to the beads, then 200^1 of phenolxhloroform, vortex, and spin to 
separate the sample with phenol. 

Pool the DNA together (about 700|il) and use phenol to clean the DNA again, DNA is then 
10 precipitated in 2|ag of glycogen and 0.5 vol of 7.5M NH40Ac and 2 vol of 100% EtOH. 
Precipitate overnight. Spin down the pellet and wash with 70% EtOH, air-dry the pellet. 



DNA 250^x1 DNA 200^1 

15 7.5MNH40AC 125^1 7.5M NH40Ac lOOfal 

1 00% EtOH 750|Ltl 1 00% EtOH 600^1 

glycogen 1 |ig/fil 2|il glycogen l^g/^l 2^1 

H. Sal I adapter ligation 
20 Resuspend the pellet in 26 of water and use 1^1 for TAE gel. 

Set up reaction as following: 
2"^ strand cDNA 25^1 
*5XT4 DNA ligase buffer lOjil 
25 *Sal I adapters \0\x\ 
*T4 DNA ligase 5 |il 

Mix gently, incubate the reaction at 1 S^'C overnight. 

Add 2^1 of ligase second day and incubate at room temperature for 2 hrs (optional). 

30 Add SO^il water to the reaction and use lOOfil of phenol to clean the DNA, 90ji] of the 
upper phase is transferred into a new tube and precipitate in: 
Glycogen \)i^\x\ 2|il 
Upper phase DNA 90|il . 
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7.5MNH40AC - >-;50|al ■ :* v;-/ r'l. . 

100%EtOH 300^11 
precipitate al -20*^C overnight 

Spin down the pellet at 4''C and wash in 70% EtOH, dry the pellet. 

5 

/, Not I digestion 
2"^cDNA 41^1 
*Reaction 3 buffer 5\x\ 
*NotI15u/^I 4^1 
10 Mix gently and incubate the reaction at 37^^C for 2hr. 

Add 50 \x\ of water and 100|al of phenoL vortex , and take 90jal of the upper phase to a new 

tube, then add 50fal of NH40Ac and 300 ul of EtOH. Precipitate overnight at 

-20T. 

15 Cloning, ligation, and transformation are performed per the Superscript cDNA synthesis 
kit. 

Example 3 

This example describes cDNA sequencing and library subtraction. 

20 Individual colonies can be picked and DNA prepared either by PGR with Ml 3 

forward primers and Ml 3 reverse primers, or by plasmid isolation. cDNA clones can be 
sequenced using Ml 3 reverse primers. 

cDNA libraries are plated out on 22 x 22 cm" agar plate at density of about 3,000 
colonies per plate. The plates are incubated in a 3TC incubator for 12-24 hours. Colonies 

25 are picked into 384-well plates by a robot colony picker, Q-bot (GENETIX Limited). 

These plates are incubated overnight at 37°C. Once sufficient colonies are picked, they are 
pinned onto 22 x 22 cm^ nylon membranes using Q-bot. Each membrane holds 9,216 or 
36,864 colonies. These membranes are placed onto an agar plate with an appropriate 
antibiotic. The plates are incubated at 37°C ovemight. 

30 After colonies are recovered on the second day, these filters are placed on filter 

paper prewetted with denaturing solution for four minutes, then incubated on top of a 
boiling water bath for an additional four minutes. The filters are then placed on filter 
paper prewetted with neutralizing solution for four minutes. After excess solution is 
removed by placing the filters on dry filter papers for one minute, the colony side of the 
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filters is placed into Proteinase K solution, incubated at for 40-50 minutes. The 
filters are placed on dry filter papers to dry overnight. DNA is then cross-linked to nylon 
membrane by UV light treatment. 

Colony hybridization is conducted as described by Sambrook,J., Fritsch, E.F. and 
5 Maniatis, T., (in Molecular Cloning: A laboratory Manual, 2"*^ Edition). The following 
probes can be used in colony hybridization: 

1. First strand cDNA fi*om the same tissue as the library was made from to remove 
the most redundant clones. 

2. 48-192 most redundant cDNA clones from the same library based on previous 
1 0 sequencing data. 

3. 192 most redundant cDNA clones in the entire maize sequence database. 

4. A Sal-A20 oligo nucleotide: TCG ACC CAC GCG TCC GAA AAA AAA 
AA.A AAA AAA AAA, removes clones containing a poly A tail but no cDNA. 

5. cDNA clones derived from rRNA. 

15 The image of the autoradiography is scanned into computer and the signal intensity 

and cold colony addresses of each colony is analyzed. Re-arraying of cold-colonies from 
384 well plates to 96 well plates is conducted using Q-bot. 

Example 4 

20 This example describes identification of the gene fi-om a computer homology 

search. 

Gene identities can be determined by conducting BLAST (Basic Local Alignment 
Search Tool; AUschul, S. F., et al., (1993) J. Mol. Biol. 215:403-410; see also 
www. ncbi.nlm.nih.gov/BLAST/) searches under default parameters for similarity to 

25 sequences contained in the BLAST "nr" database (comprising all non-redundant GenBank 
CDS iranshinons, sequences derived from the 3-dimensiona! structure Brookhaven Protein 
Data Bank, ilic lasi major release of the SWISS-PROT protein sequence database, EMBL, 
and DDBJ databases). The cDNA sequences are analyzed for similarity to a;ll publicly 
available DNA sequences contained in the **nr" database using the BLASTN algorithm. 

30 The DNA sequences are translated in all reading fi-ames and compared for similarity to all 
publicly available protein sequences contained in the "nr" database using the BLASTX 
algorithm (Gish, W. and States, D. J. Nature Genetics 5:266-272 (1993)) provided by the 
NCBI. In some cases, the sequencing data from two or more clones containing 
overlapping segments of DNA are used to construct contiguous DNA sequences. 
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Sequence alignments and percent' identity calculations can be perfomied using the 
Megalign program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., 
Madison, WI). Multiple alignment of the sequences can be perfomied using the Clustal 
method of alignment (Higgins and Sharp (1989) CABJOS. 5:151-153) with the default 
5 parameters (GAP PENALTY=10, GAP LENGTH PENALTY=10). Default parameters 
for pairwise alignments using the Clustal method are KTUPLE 1, GAP PENALTY=3, 
WINDOW=5 and DIAGONALS SAVED=5. 

Example 5 

10 This example describes expression of transgenes in monocot cells^ 

A transgene comprising a cDNA encoding the instant polypeptides in sense 
orientation with respect to the maize 27 kD zein promoter that is located 5' to the cDNA 
fragment, and the 10 kD zein 3' end that is located 3* to the cDNA fragment, can be 
constructed. The cDNA fragment of this gene may be generated by polymerase chain 

15 reaction (PGR) of the cDNA clone using appropriate oligonucleotide primers. Cloning 
sites (Ncol or Smal) can be incorporated into the oligonucleotides to provide proper 
orientation of the DNA fragment when inserted into the digested vector pML103 as 
described below. Amplification is then performed in a standard PGR. The amplified DNA 
is then digested with restriction enzymes Ncol and Smal and fractionated on an agarose 

20 gel. The appropriate band can be isolated from the gel and combined with a 4,9 kb 

Ncol-Smal fragment of the plasmid pML103. Plasmid pML103 has been deposited under 
the terms of the Budapest Treaty at ATCC (American Type Culture Collection, 10801 
University Blvd., Manassas, VA 201 10-2209), and bears accession number ATCC 97366. 
The DNA segment from pML103 contains a 1.05 kb Sall-Ncol promoter fragment of the 

25 maize 27 IcD zein gene and a 0.96 kb Smal-Sall fragment from the 3' end of the maize 
; 10 kD zein gene in the vector pGem9Zf(+) (Promega).. Vector and insert DNA can be 
ligated at 15^C overnight, essentially as described (Maniatis). The ligated DNA may then 
be used to transform E. coli XLl-Blue (Epicurian Coli XL-1 Blue; Stratagene). Bacterial 
transformants can be screened by restriction enzyme digestion of plasmid DNA and 

30 limited nucleotide sequence analysis using the dideoxy chain termination method 

(Sequenase DNA Sequencing Kit; U, S. Biochemical), The resulting plasmid construct 
would comprise a transgene encoding, in the 5' to 3* direction, the maize 27 kD zein 
promoter, a cDNA fragment encoding the instant polypeptides, and the 10 kD zein 3* 
region. r 
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The transgene described above can then be introduced into maize cells by the 
following procedure. Immature maize embry os can be dissected from developing 
caryopses derived from crosses of the inbred maize lines H99 and LH132. The embryos 
are isolated 10 to 11 days after pollination when they are 1.0 to 1.5 mm long. The 
5 embryos are then placed with the axis-side facing down and in contact with agarose- 

solidified N6 medium (Chu et al. (1975) Sci. Svu Peking 18:659-668). The embryos are 
kept in the dark at 27X. Friable embryogenic callus consisting of undifferentiated masses 
of cells with somatic proembryoids and embryoids borne on suspensor structures 
proliferates from the scutellum of these immature embryos. The embryogenic callus 

10 isolated from the primary explant can be cultured on N6 medium and sub-cultured on this 
medium every 2 to 3 weeks. 

The plasmid, p35S/Ac (Hoechst Ag, Frankfurt, Germany) or equivalent may be used 
in transformation experiments in order to provide for a selectable marker. This plasmid 
contains the Pat gene (see European Patent Publication 0 242 236) which encodes 

1 5 phosphinothricin acetyl transferase (PAT). The enzyme PAT confers resistance to 
herbicidal glutamine synthetase inhibitors such as phosphinothricin. The pat gene in 
p35S/Ac is under the control of the 35S promoter from Cauliflower Mosaic Virus (Odell 
et al. (1985) Nature 2>\Z\%\Q'Z\2) and the 3' region of the nopaline synthase gene from the 
T-DNA of the Ti plasmid of Agrobacterhim tumefaciens. 

20 The particle bombardment method (Klein et al (1987) Nature 327:70-73) may be 

used to transfer genes to the callus culture cells. According to this method, gold particles 
(1 (im in diameter) are coated with DNA using the following technique. Ten ^g of 
plasmid DNAs are added to 50 |iL of a suspension of gold particles (60 mg per mL). 
Calcium chloride (50 |iL of a 2.5 M solution) and spermidine free base (20 |iL of a LO M 

25 solution) are added to the particles. The suspension is vortexed during the addition of 
these solutions. After 10 minutes, the tubes are briefly centrifuged (5 sec at 15,000 rpm) 
and the supernatant removed. The particles are resuspended in 200 jiL of absolute 
ethanol, centrifuged again and the supernatant removed. The ethanol rinse is performed 
again and the particles resuspended in a final volume of 30 |iL of ethanol. An aliquot 

30 (5 jiL) of the DNA-coated gold particles can be placed in the center of a Kapton flying 
disc (Bio-Rad Labs). The particles are then accelerated into the maize tissue with a 
Biolistic PDS-lOOO/He (Bio-Rad Instruments, Hercules CA), using a helium pressure of 
1000 psi, a gap distance of 0.5 cm and a flying distance of 1 .0 cm. 
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For bombardment, the embryo gen it nss&^ is placed on filler paper over agarose- 
solidified N6 medium. The tissue is arranged as a thin lawn and covers a circular area of 
about 5 cm in diameter. The petri dish containing the tissue can be placed in the chamber 
of the PDS-lOOO/He approximately 8 cm from the stopping screen. The air in the chamber 
5 is then evacuated to a vacuum of 28 inches of Hg. The macrocarrier is accelerated with a 
helium shock wave using a rupture membrane that bursts when the He pressure in the 
shock tube reaches 1000 psi. 

Seven days after bombardment the tissue can be transferred to N6 medium that 
contains gluphosinate (2 mg per liter) and lacks casein or proline. The tissue continues to 

10 grow slowly on this medium. After an additional 2 weeks the tissue can be transferred to 
fresh N6 medium containing gluphosinate. After 6 weeks, areas of about 1 cm in diameter 
of actively growing callus can be identified on some of the plates containing the 
glufosinate-supplemented medium. These calli may continue to grow when sub-cultured 
on the selective medium. 

15 Plants can be regenerated from the transgenic callus by first transferring clusters of 

tissue to N6 medium supplemented with 0.2 mg per liter of 2,4-D. After two weeks the 
tissue can be transferred to regeneration medium (Fromm et aL (1990) Bio/Technology 
5:833-839). 

20 Example 6 

This example describes expression of transgenes in dicot cells. 
A seed-specific expression cassette composed of the promoter and transcription 
terminator from the gene encoding the P subunit of the seed storage protein phaseolin from 
the bean Phaseolus vulgaris (Doyle et al (1986) J. Biol Chem, 261 :9228-9238) can be 
25 used for expression of the instant polypeptides in transformed soybean. The phaseolin 

cassette includes about 500 nucleotides upstream (5*) from the translation initiation codon 
and about 1650 nucleotides downstream (3') from the translation stopi codon of phaseolin. 
Between the 5' and 3* regions are the unique restriction endonuclease sites Nco I (which 
includes the ATG translation initiation codon), Smal, Kpnl and Xbal. The entire Cassette 
30 is flanked by Hind III sites. 

The cDNA fragment of this gene may be generated by polymerase chain reaction 
(PGR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites can be 
incorporated into the oligonucleotides to provide proper orientation of the DNA fragment 
when inserted into the expression vector. Amplification is then performed as described 
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above, and the isolated fragment is inserted into a pUClS vector carrying the seed 
expression cassette. 

Soybean embroys may then be transformed with the expression vector comprising 
sequences encoding the instant polypeptides. To induce somatic embryos, cotyledons, 
5 3-5 mm in length dissected from surface sterilized, immature seeds of the soybean cultivar 
A2872, can be cultured in the hght or dark at 26^C on an appropriate agar medium for 
6-10 v^eeks. Somatic embryos which produce secondary embryos are then excised and 
placed into a suitable Hquid medium. After repeated selection for clusters of somatic 
embryos which multiplied as early, globular staged embryos, the suspensions are 

1 0 maintained as described below. 

Soybean embryogenic suspension cultures can be maintained in 35 mL liquid media 
on a rotary shaker, 150 rpm, at 26°C with florescent lights on a 16:8 hour day/night 
schedule. Cultures are subcultured eveiy two weeks by inoculating approximately 35 mg 
of tissue into 35 mL of liquid medium. 

15 Soybean embryogenic suspension cultures may then be transformed by the method 

of particle gun bombardment (Klein et aL (1987) Nature (London) 327:10-13, U.S: Patent 
No. 4,945,050). A Du Pont Biolistic PDSIOOO.^E instrament (helium retrofit) can be 
used for these transformations. 

A selectable marker gene which can be used to facilitate soybean transformation is a 

20 transgene composed of the 35S promoter from Cauliflower Mosaic Vims (Odell et 

^/.(1985) Nature 573:810-812), the hygromycin phosphotransferase gene from plasmid 
pJR225 (from E, coli\ Gritz et a/.(1983) Gene 25:179-188) and the 3' region of the 
nopaline synthase gene from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. 
The seed expression cassette comprising the phaseolin 5* region, the fragment encoding the 

25 instant polypeptides and the phaseolin 3* region can be isolated as a restriction fragment. 
This fragment can then be inserted into a unique restriction site of the vector carrying the 
marker gene. 

To 50 |j.L of a 60 mg/mL 1 \im gold particle suspension is added (in order): 5 jaL 
DNA (1 i^g/^iL), 20 ^l spermidine (0.1 M), and 50 ^L CaCl2 (2.5 M). Tlie particle 
30 preparation is then agitated for three minutes, spun in a micro fuge for 10 seconds and the 
supernatant removed. The DNA-coated particles are then washed once in 400 yiL 70% 
ethanol and resuspended in 40 jxL of anhydrous ethanol. The DN A/particle suspension can 
be sonicated three times for one second each. Five microliters of the DNA-coated gold 
particles are then loaded on each macro carrier disk. 
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Approximately 300-400 mg of a- nvo^week-old suspension culture is placed in an 
empty 60x15 mm petri dish and the residual liquid removed from the tissue with a pipette. 
For each transformation experiment, approximately 5-10 plates of tissue are normally 
bombarded. Membrane rupture pressure is set at 1 100 psi and the chamber is evacuated to 
5 a vacuum of 28 inches mercury. The tissue is placed approximately 3.5 inches away from 
the retaining screen and bombarded three times. Following bombardment, the tissue can 
be divided in half and placed back into liquid and cultured as described above. 

Five to seven days post bombardment, the hquid media may be exchanged with fresh 
media, and eleven to twelve days post bombardment with fresh media containing 

1 0 50 mg/mL hygromycin. This selective media can be refreshed weekly. Seven to 

eight weeks post bombardment, green, transformed tissue may be observed growing from 
un transformed, necrotic embryogenic clusters. Isolated green tissue is removed and 
inoculated into individual flasks to generate new, clonally propagated, transformed 
embryogenic suspension cultures. Each new line may be treated as an independent 

15 transformation event. These suspensions can then be subcultured and maintained as v 
clusters of immature embryos or regenerated into whole plants by maturation and 
germination of individual somatic embryos. 

Example 7 

20 This example describes expression of a transgene in microbial cells. 

The cDNAs encoding the instant polypeptides can be inserted into the T7 E, coW 
expression vector pBT430. This vector is a derivative of pET-3a (Rosenberg et al. (1987) 
Gene 55:125-135) which employs the bacteriophage T7 RNA polymerase/T7 promoter 
system. Plasmid pBT430 was constructed by first destroying the EcoR I and Hind III sites 

25 in pET-3a at their original positions. An oligonucleotide adaptor containing EcoR I and 
Hind ni sites was inserted at the BamH I site of pET-3a. This created pET-:3aM with 
additional unique cloning sites for insertion of genes into the expression vector. Then, the 
Nde I site at the position of translation initiation was converted to an Nco I site using 
oligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this region, 

30 5'-CATATGG, was converted to 5*-CCCATGG in pBT430. 

Plasmid DNA containing a cDNA may be appropriately digested to release a nucleic 
acid fragment encoding the protein. This fragment may then be purified on a 1% NuSieve 
GTG low melting agarose gel (FMC). Buffer and agarose contain 10 jag/ml ethidium 
bromide for visualization of the DNA fragment. The fragment can then be purified from 
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the agarose ge! by digestion with GELase (Epicentre Technologies) according to the 
manufacturer's instructions, ethanol precipitated, dried and resuspended in 20 )iL of water. 
Appropriate oligonucleotide adapters may be ligated to the fragment using T4 DNA ligase 
(New England Biolabs, Beverly, MA). The fragment containing the ligated adapters can 
5 be purified from the excess adapters using low melting agarose as described above. The 
vector pBT430 is digested, dephosphorylated with alkaline phosphatase (NEB) and 
deproteinized with phenol/chloroform as described above. The prepared vector pBT430 
and fragment can then be ligated at for 15 hours followed by transformation into 
DH5 electrocompetent cells (GIBCO BRL). Transformants can be selected on agar plates 

10 containing LB media and 100 jig/mL ampicillin. Transformants containing the gene 

encoding the instant polypeptides are then screened for the correct orientation with respect 
to the T7 promoter by restriction enzyme analysis. 

For high level expression, a plasmid clone with the cDNA insert in the correct 
orientation relative to the T7 promoter can be transformed into E. coli strain BL21(DE3) 

15 (Studier et al. (1986) J. Mol Biol 759:1 13-130). Cultures are grown in LB medium 
containing ampicillin (100 mg/L) at 25*^C. At an optical density at 600 nm of 
approximately 1, IPTG (isopropylthio-P-galactoside, the inducer) can be added to a final 
concentration of 0.4 mM and incubation can be continued for 3 h at 25°. Cells are then 
harvested by centrifugation and re-suspended in 50 fiL of 50 mM Tris-HCl at pH 8.0 

20 containing 0.1 mM DTT and 0,2 mM phenyl methylsulfonyl fluoride. A small amount of 
1 mm glass beads can be added and the mixture sonicated 3 times for about 5 seconds each 
time with a microprobe sonicator. The mixture is centrifuged and the protein 
concentration of the supernatant detennined. One microgram of protein from the soluble 
fraction of the culture can be separated by SDS-polyacrylamide gel electrophoresis. Gels 

25 can be observed for protein bands migrating at the expected molecular weight. 

The above examples are provided to illustrate the invention but not to limit its 
scope. Other variants of the invention will be readily apparent to one of ordinary skill in 
the art and are encompassed by the appended claims. All publications, patents, patent 
30 ^plications, and computer programs cited herein are hereby incorporated by reference. 

Deposits 

Plasmids containing polynucleotide sequences of the invention were deposited on 
March 21, 2000, with the American Type Culture Collection (ATCC), 10801 University 
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Boulevard, Manassas, Virginia USA, 201 10-2209, and' assigned Accession Nos. PTA- 
1539, PTA-1 540, and PTA-1541 . These deposits will be maintained under the terms of the 
Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the 
Purposes of Patent Procedure. In addition, during the pendency of this patent application, 
5 access to the deposited cultures will be available to the Commissioner of Patents and 
Trademarks and to persons determined by the Commissioner to be entitled thereto under 
37 C.F.R. §114 and 35 U.S.C. §122. 

These deposits were made merely as a convenience for those of skill in the art and 
are not an admission that a deposit is required under 35 U.S.C. § 112. All restrictions 
10 imposed by the depositor on the availability to the public of the deposited material will be 
irrevocably removed upon granting of a patent. However, it should be understood that the 
availability of a deposit does not constimte a license to practice the subject invention in 
derogation of patent rights granted by government action. 
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WHAT IS CLAIMED IS: 

1. An isolated nucleic acid comprising a member selected from the group 
consisting of: 

(a) a polynucleotide having at least 80% sequence identity, as determined by the 

5 GAP algorithm under default parameters, to a polynucleotide of SEQ ID NO: 1, 

5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45,.49, 53, 57, 61, or 65; 

(b) a polynucleotide encoding a polypeptide of SEQ ID NO: 2, 6, 10, 14, 18, 22, 
26, 30, 34, 38, 42, 46, 50, 54, 58, 62, or 66; 

(c) a polynucleotide amplified from a Zea mays nucleic acid library using primers 
10 which selectively hybridize, under stringent hybridization conditions, to loci 

within a polynucleotide of SEQ ID NO: I, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 
45,49, 53, 57, 61, or 65; 

(d) a polynucleotide which selectively hybridizes, under stringent hybridization 
conditions and a wash in 0.1 X SSC at 60°C, to a polynucleotide of SEQ ID 

15 NO: 1,5,9, 13, 17, 21, 25, 29, 33, 37, 41, 45, 49, 53, 57, 61, or 65; 

(e) a polynucleotide of SEQ ID NO: 1,5,9, 13, 17,21,25,29,33,37,41,45,49, 
53, 57, 61, or 65; 

(f) a polynucleotide which is complementary to a polynucleotide of (a), (b), (c), 
(d), or (e); and 

20 (g) a polynucleotide comprising at least 25 contiguous nucleotides from a 

polynucleotide of (a), (b), (c), (d), (e), or (f). 

2. A recombinant expression cassette, comprising a member of claim 1 operably 
linked, in sense or anti-sense orientation, to a promoter. 

25 

3. A host cell comprising the recombinant expression cassette of claim 2, 

4. A transgenic plant comprising a recombinant expression cassette of claim 2. 
30 5. The transgenic plant of claim 4, wherein said plant is a monocot. 

6. The transgenic plant of claim 4, wherein said plant is a dicot. 
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7. The transgenic plant of claim 4, wherein said plant is selected from the group 
consisting of: maize, soybean, sunflower, sorghum, canola, wheat, alfalfa, cotton, rice, 
barley, millet, peanut, and cocoa. 

5 8. A transgenic seed from the transgenic plant of claim 4. 

9. A method of modulating the level of signal transduction gene expression in a 
plant cell, comprising: 

(a) introducing into a plant cell a recombinant expression cassette comprising a 
1 0 polynucleotide of claim 1 operably linked to a promoter; 

(b) culturing the plant cell under plant cell growing conditions; and 

(c) inducing expression of said polynucleotide for a time sufficient to modulate the 
level of signal transduction gene expression in said plant cell. 

15 10. The method of claim 9, wherein the plant cell is from maize, wheat, rice, or 

soybean, 

1 1. A method of modulating the level of signal transduction gene expression in a 
plant, comprising: 

20 (a) introducing into a plant cell a recombinant expression cassette comprising a 

poljmucleotide of claim 1 operably linked to a promoter; 

(b) culturing the plant cell under plant cell growing conditions; 

(c) regenerating a plant from said plant cell; and 

(d) inducing expression of said polynucleotide for a time sufficient to rnpdulate the 
25 level of signal transduction gene expression in said plant. 

12. The method of claim 1 1 , wherein the plant is maize, wheat, rice, or soybean. 

13. An isolated protein comprising a member selected from the group consisting 

30 of: 

(a) a polypeptide of at least 20 contiguous amino acids from a polypeptide of SEQ 
ID NO: 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42, 46, 50, 54, 58, 62, or 66; 

(b) a polypeptide of SEQ ED NO: 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42, 46, 50, 54, 
58, 62, or 66; 
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(c) a polypeptide having at least 80% sequence identity to, and having at least one 
epitope in common with, a polypeptide of SEQ ID NO: 2, 6, 10, 14, 18, 22, 26, 
30, 34, 38, 42, 46, 50, 54, 58, 62, or 66, wherein said sequence identity is 
determined by the GAP algorithm under default parameters; and, 
5 (d) at least one polypeptide encoded by a member of claim 1 . 
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SEQUENCE LISTING 



<110> Pioneer Hi -Bred Incerna::ional , Inc. 



<120> Signal Transduction Genes and Methods of 



Use 

<130> 1083-PCT 

<150> US 60/134,292 
<151> 1999-05-14 

<150> US 60/142,996 
<151> 1999-07-08 

<160> 69 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 2029 

<212> DNA 

<213> Zea mays 

<220> 
<221> CDS 

<222> (236) . . . (1543) 
<400> 1 

gtcgacccac gcgtccggaa gactgagtaa tgtagatttc tcatctgctg caatctctat 60 
ccaagatacc aatgtatggt cagcacggtc tctctggtct tctgatgcga tgggatcttc 120 
acctccttgt gattcacctg acaagtttgc aggtggtcag gagcgttctc ctctctctag 180 
tccactgaga agtccagttt taaaatcaag aaacccaagt gcacctccat caeca atg 238 



Met 
1 



cat cca aag ttg ttc ccg gag aac cat gtt tct cgc cct gat ggc aat 
His Pro Lys Leu Phe Pro Glu Asn His Val Ser Arg Pro Asp Gly Asn 
5 10 15 



286 



gga agt grt agt ttt cac cca tta ccg etc cct cct get tct gta age 
Gly Ser Aicj Scr Phe His Pro Leu Pro Leu Pro Pro Ala Ser Val Ser 
20 25 30 



334 



cca aAg caq aca aat get age cac cag tta gtt cca aaa get gag atg 
Pro LvE Cln Thr Asn Ala Ser His Gin Leu Val Pro Lys Ala Glu Met 
35 40 45 



382 



cct teg gtg get ggt caa tgg cag aaa gga aaa etc ttg ggt agt ggc 
Pro Ser Val Ala Gly Gin Trp Gin Lys Gly Lys Leu Leu Gly Ser Gly 
50 55 60 65 



430 



acc ttt gga tgc gtg tat gag gcc ace aat agg cac act gga get ett 
Thr Phe Gly Cys Val Tyr Glu Ala Thr Asn Arg His Thr Gly Ala Leu 
70 75 80 



478 



tgt gcc atg aaa gag gtg aat ata att cca gat gat get aaa tea gte 
Cys Ala Met Lys Glu Val Asn lie lie Pro Asp Asp Ala Lys Ser Val 
85 90 95 



526 



gag tct ctg aag caa ttg gaa cag gaa att aag ttt ett agt caa ttt 



574 
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Glu Ser Leu Lys Gin Leu Glu Gin Glu lie Lys Phe Leu Ser Gin Phe 
100 105 110 

aag cat gaa aat att gtg cag tac tat ggc agt gaa act ate gaa gat 622 
Lys His Glu Asn He Val Gin Tyr Tyr Gly Ser Glu Thr He Glu Asp 
115 120 125 

aga ttc tac ata tac ctg gag tat gtt cat cct ggt tea ate cat aag 670 
Arg Phe Tyr He Tyr Leu Glu Tyr Val His Pro Gly Ser lie His Lys 
130 135 140 145 

tat gtt cat caa cac tgt gga teg ctg aca gaa gca gtc ate cgt aae 718 
Tyr Val His Gin His Cys Gly Ser Leu Thr Glu Ala Val He Arg Asn 
150 155 160 

ttc aca cgt cat att etc aag ggt ctt get ttt tta cat age caa aag 766 
Phe Thr Arg His He Leu Lys Gly Leu Ala Phe Leu His Ser Gin Lys 
165 170 175 

att atg cat agg gat ate aaa ggg gca aat t-g tta gtt gat att aae 814 
He Met His Arg Asp He Lys Gly Ala Asn Leu Leu Val Asp He Asn 
180 185 190 

ggt gta gtg aaa ttg get gac ttc gga atg gca aag cat ttg agt act 862 
Gly Val Val Lys Leu Ala Asp Phe Gly Met Ala Lys His Leu Ser Thr 
195 200 205 

gca get cct aat ctt tea ttg aag gga aca eca tac tgg atg get cct 910 
Ala Ala Pro Asn Leu Ser Leu Lys Gly Thr Pro Tyr Trp Met Ala Pro 
210 215 * 220 225 

gag gtt gtt egg get aca etc gat aaa agt gca ggt tat gat ctt get 958 
Glu Val Val Arg Ala Thr Leu Asp Lys Ser Ala Gly Tyr Asp Leu Ala 
230 235 240 



gtt gat att tgg age ctt gge tgc aca ate att gag atg ttt aca gga 
Val Asp He Trp Ser Leu Gly Cys Thr He He Glu Met Phe Thr Gly 
245 250 255 



1006 



aag eca cct tgg agt ggt ctt gaa ggg ccg get gca atg ttt aag gtg 1054 

Lys Pro .Pro Trp Ser Gly Leu Glu Gly Pro Ala Ala Met Phe Lys Val 
260 . 265 270 

ttg cgc acg gac cea cea att eca gat aae ctg tee cct gaa ggg aag 1102 

Leu Arg Thr Asp Pro Pro He Pro Asp Asn Leu Ser Pro Glu Gly Lys 

275 280 285 

gat ttt ctg aga tgc tgc ttc aag aga aae cct act gag ega eca aca 1150 

Asp Phe Leu Arg Cys Cys Phe Lys Arg Asn Pro Thr Glu Arg Pro Thr 
290 295 300 305 

gca aae aag ttg etc gaa cac eca ttt ate caa act ttg aac cac tac 1198 

Ala Asn Lys Leu Leu Glu His Pro Phe He Gin Thr Leu Asa His Tyr 
310 315 320 

age cea cat age gtt gta cat tea ttt teg gga att aaa tct eca gat 1246 

Ser Pro His Ser Val Val His Ser Phe Ser Gly He Lys Ser Pro Asp 
325 330 335 

act gtg cac age agt gca aga gat aaa gta cec tgg aaa agt gat tea 12 94 

Thr Val His Ser Ser Ala Arg Asp Lys Val Pro Trp Lys Ser Asp Ser 
340 345 350 
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tat atg aga ggg aaa cat gca aat ggt gaa aca age agt get egg teg 
Tyr Met Arg Gly Lys His Ala Asn Giy Glu Thr .Ser Ser Ala Arg Ser 
355 360 365 



1342 



tea gag tea ttg get tat egg ttg atg aca ccc eta ccc aac ttg gga 
Ser Glu Ser Leu Ala Tyr Arg Leu Met Thr Pro Leu Pro Asn Leu Gly 
370 375 380 385 



1390 



aec aat agt ttg tea cct cca ect ttg agt teg gcc tec agt tct ggc 
Thr Asn Ser Leu Ser Pro Pro Pro Leu Ser Ser Ala Ser Ser Ser Gly 
390 395 400 



1438 



tct gca gca cat aca ccg aac agt atg cat ttt tea gtc cca tac cat 
Ser Ala Ala His Thr Pro Asn Ser Met His Phe Ser Val Pro Tyr His 
405 410 415 



1486 



cag cct agt cct ttg cca aag cct aac ggg aag gaa gca ata aat ttg 
Gin Pro Ser Pro Leu Pro Lys Pro Asn Gly Lys Glu Ala lie Asn Leu 
420 425 430 



1534 



gtt teg cat tgaataaeag catgccacct atcacaacat ctgggcaaag 
Val Ser His 
435 
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tccaeataaa 
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gttgatgctg 
aataatgcac 
aaaaaaaaaa 



cagcgcaagt 
tttctgggtc 
aaatgctgct 
tgattggatt 
tcaagattgt 
tgtaatgatc 
gtcaattggc 
aaaaaagggc 



gtatccgtce 
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aaatgaggga 

gcccatgtaa . 
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atacttgatg 
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ctagcgtctg 
aatttccatg 
gctttgtgaa 
catattttta 
aggacattag 
taatcattat 
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acccttgtac 
taaagagatc 
atctgttgta 
ttagggtgta 
ggttgtagct 
tgattttttt 
aaaaaaaaaa 



tgccagtaat 
accgcgatat 
ccgtttgatt 
gctgtaggtg 
gtagcgtgtt 
tggggaaggt 
aaaaaaaaaa 



1643 
1703 
1763 
1823 
1883 
1943 
2003 
2029 
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cgcgccgccc ccccacactc cgcccgtgcz gtccarctgc ctgaatatat tctttttcgt 120 

tgtcatgtgt gccgtgggaa ttgctccart: cgactggcaa gtgcccgtcc ctgaccatgc 180 

agcctctcat ccggactgga ttggttccgg crgrcgctcg .ccccaggaag aacactggac 240 

atg gac ttc ttc acg gaa tat ggg gag gcg agt cag tat caa ate caa 288 
Met Asp Phe Phe Thr Glu Tyr Gly Glu Ala Ser Gin Tyr Gin lie Gin 
15 10 15 

gag gtc att ggt aag gga agt tat gga gta gzz get get gca att gat 336 
Glu Val He Gly Lys Gly Ser Tyr Gly Val Val Ala Ala Ala He Asp 
20 25 30 

agt cac act ggg gag egg gtt gcg ate aag aag ata aat gat att ttt 384 
Ser His Thr Gly Glu Arg Val Ala. He Lys Lys He Asn Asp He Phe 
35 40 45 

ggt aat gtc teg gat get get cgc att ctg egg gaa ate aag ctt ctt . 432 
Gly Asn Val Ser Asp Ala Ala Arg He Leu Arg Glu He Lys Leu Leu 
50 55 60 

cgt ctg ctt cgt cac cea aac ata gtc cag art aag cac att atg ttg 480 
Arg Leu Leu Arg His Pro Asn He Val Gin He Lys His He Met Leu 
65 70 75 80 

ccc cct acc cga agg gag tte aaa gat ate tat gtt gtt ttt gag etc 528 
Pro Pro Thr Arg Arg Glu Phe Lys Asp He Tyr Val Val Phe Glu, Leu 
85 90 95 

atg gag tct gac ctg cat caa gtg ate aaa gca aat gat aac ctg act 576 
Met Glu Ser Asp Leu His Gin Val He Lys Ala Asn Asp Asn Leu Thr 
100 105 110 

cca gag cat cac cgc ttt ttc ctg tat caa ctt att cgt get etc aag 624 
Pro Glu His His Arg Phe Phe Leu Tyr Gin Leu He Arg Ala Leu Lys 
H5 120 125 

tac atg cat tea gee cat gta ttt cat cgt. gat tta aag ccc agg aac 672 
Tyr Met His Ser Ala His Val Phe His Arg Asp Leu Lys Pro Arg Asn 
130 135 140 

ata ctg gee aat tea gac age aaa eta aag ate tge gac ttt gga ctt 720 
He Leu Ala Asn Ser Asp Ser Lys Leu Lys He Cys Asp Phe Gly Leu 
145 150 155 160 



gca cgt gca tea ttt aac gac tct ctt tea get ate tat tgg aca gat 
Ala Arg Ala Ser Phe Asn Asp Ser Leu Ser Ala He Tyr Trp Thr Asp 



165 170 175 



768 



tat gtc gca aca agg tgg tac egg get cct gaa tta tgt ggc tec ttt 816 
Tyr Val Ala Thr Arg Trp Tyr Arg Ala Pro Glu Leu Cys Gly Ser . Phe 
180 185 190 

ttc tec agt tac acc cct gca att gat att tgg agt ata ggg tgc att 864 
Phe Ser Ser Tyr Thr Pro Ala He Asp He Trp Ser He Gly Cys He 
195 200 205 

ttt get gaa gtt etc acc gga aeg cca ttg ttt cct ggg agg aat gtt 912 
Phe Ala Glu Val Leu Thr Gly Thr Pro Leu Phe Pro Gly Arg Asn Val 
210 215 220 

gtg cac caa tta gac tta ata aca gat etc ctt gga aca cea teg ttt 960 
Val His Gin Leu Asp Leu He Thr Asp Leu Leu Gly Thr Pro Ser Phe 
225 230 . - -. 235 240 
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aga tec tta tec cag att cac agt gat aag gcc agg gaa tat ttg ctt 1008 

Arg Ser Leu Sex Gin lie His Ser Asp Lys Ala Arg Glu Tyr Leu Leu 

245 250 255 

ggc atg ccg agg aag cgc cct att. oca t~t tct cat aag ttc cgt aat 1056 

Gly Met Pro Arg Lys Arg Pro lie Pro Phe Ser His Lys Phe Arg Asn 

260 265 270 

gcc gat cct ctg get etc egg ttg eta gag cgt tta ctt gca ttt gac 1104 

Ala Asp Pro Leu Ala Leu Arg Leu Leu Glu Arg Leu Leu Ala Phe Asp 

275 280 285 

ccg aaa gat cga cct act get gaa gag get tta get gat cca tac ttc 1152 

Pro Lys Asp Arg Pro Thr Ala Glu Glu Ala Leu Ala Asp Pro Tyr Phe 

290 295 300 

agt ggc ctt tct aaa ttg gaa ctt gag cct tea gca caa cca att ttg 1200 

Ser Gly Leu Ser Lys Leu Glu Leu Glu Pro Ser Ala Gin Pro lie Leu 

305 310 315 320 

aag gtt gac ttt gaa ttc gag gga agg aaa eta aca aag get ggt gtg 1248 

Lys Val Asp Phe Glu Phe Glu Gly Arg Lys Leu Thr Lys Ala Gly Val 

325 330 335 

aga gaa atg ata tat egg gag att ttg gag tac cat cce cag atg ctt 1296 

Arg Glu Met lie Tyr Arg Glu He Leu Glu Tyr His Pro Gin Met Leu 

340 345 350 

cag gag tac ate gaa ggt gga gat cag att cac ttc eta tac cca agt 1344 

Gin Glu Tyr He Glu Gly Gly Asp Gin He His Phe Leu Tyr Pro Ser 

355 360 365 

999 9tt gat cgc ttc cag cga cag ttt get eat etc gag gag aac tac 1392 

Gly Val Asp Arg Phe Gin Arg Gin Phe Ala His Leu Glu Glu Asn Tyr 

370 375 380 

^99 ^9^ gga gta get age acc cce ccg cga agg cag cce aca tct ctg 1440 

Arg Arg Gly Val Ala Ser Thr Pro Pro Arg Arg Gin Pro Thr Ser Leu 

385 390 395 400 

ccg agg gag cga gtc tgc tea tea gaa gat ggc cac agt cag gac tct 1488 

Pro Arg Glu Arg Val Cys Ser Ser Glu Asp Gly His Ser Gin Asp Ser 

405 410 415 

gac age gaa gag cag aga gca gca tec tac gtt gcc cga aca acc ata 1536 

Asp Ser Glu Glu Gin Arg Ala Ala Ser Tyr Val Ala Arg Thr Thr He 

420 425 430 

age cct cca aga tea caa gag gag ggc ggc aag etc cag cct gcc tac 1584 

Ser Pro Pro Arg Ser Gin Glu Glu Gly Gly Lys Leu Gin Pro Ala Tyr 

435 440 445 

cag age gcc gcc gcc gcc gca gac tec tgc gcc aag age tat ctg aag 1632 

Gin Ser Ala Ala Ala Ala Ala Asp Ser Cys Ala Lys Ser Tyr Leu Lys 

450 455 460 

ggt gcc get agt ate agt agt get tec agg cgt acc ate aag ggg gat 1680 

Gly Ala Ala Ser He Ser Ser Ala Ser Arg Arg Thr He Lys Gly Asp 

465 470 475 480 

aat ggc cga aag gag aag gga tct ccg agg ctg tgaaagaggt ggcgcaggct 1733 
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Asn Gly Arg Lys Glu Lys Gly Ser Pro Arg Leu 
485 490 

cgtgacctgt taggcaaaag aggtagcaac ggcgccaatg cgactgcaac gtgagatgtc 1793 

ttagtcatca cgcatccaac aatgcaggag cacggagcac cacctcaccc cccccccccc 1853 

ccccccagta caagtatgca ggtcatccta gcraggtcgc ccaaattaaa gcagcctgct 1913 

tggaggaatt agatttgtgt actgttaagt ttrttttrtt ggaattcaac tgctagattc 1973 

tggcagatct gtcaaccgca ctcgccttgg tt::cccaaaa tttcctggtt tgaacaagtt 2033 

taaaaaaaaa aaaaaaaaaa gggcggccgc 2063 
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atg aag gga aaa ggg ggg cct gac aat act caa tgt gga tac cgt gga 4 8 

Met Lys Gly Lys Gly Gly Pro Asp Asn Thr Gin Cys Gly Tyr Arg Gly 
15 10 15 

gtg agg cag cgt act tgg ggg aag tgg get get gaa ata aga gag cca 96 
Val Arg Gin Arg Thr Trp Gly Lys Trp Val Ala Glu He Arg Glu Pro 
20 25 30 

aat cgt gtc gac aga etc tgg ctg ggt acc ttc cca acc gcg gag gat 144 
Asn Arg Val Asp Arg Leu Trp Leu Gly Thr Phe Pro Thr Ala Glu Asp 
35 40 45 

gca get agg gcc tat gat gag gca gcc aga gcg atg tat gga gac ttg 192 
Ala Ala Arg Ala Tyr Asp Glu Ala Ala Arg Ala Met Tyr Gly Asp Leu 
50 55 60 

gca egg act aac ttc ccc gga cag gat gca aca acc tct gcc caa get 240 
Ala Arg Thr Asn Phe Pro Gly Gin Asp Ala Thr Thr Ser Ala Gin Ala 
65 70 75 80 
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get eta gca teg acc tct gcc cag get gan cca aca get gtc gaa get 288 
Ala Leu Aia Ser Thr Ser. Ala Gin Ala Asp Pro. Thr Ala Val Glu Ala 
85 90 95 

ctt cag act ggc acg tea tgc gag teg aca acg aca tea aat tac teg 336 
Leu Gin Thr Gly Thr Ser Cys Glu Ser Thr Thr Thr Ser Asn Tyr Ser 
100 105 . 110 

gac ate gca tee acc tea cac aag ect gag cct gaa gcc tct gae ate 384 
Asp lie Ala Ser Thr Ser His Lys Pro Glu Pro Glu Ala Ser Asp lie 
115 120 125 

teg age tec eta aag gca aaa tgt cca get gga tea tgt ggt ate caa 432 
Ser Ser Ser Leu Lys Ala Lys Cys Pro Ala Gly Ser Cys Gly lie Gin 
130 135 140 

gac ggt aca cec agt gta get gae aag gag g::c ttt ggg ecg ttg gag 480 
Asp Gly Thr Pro Ser Val Ala Asp Lys Glu Val Phe Gly Pro Leu Glu 
145 150 155 160 

cct ate aca aat ctt cca gat ggt ggt gat ggt ttt gat ate ggt gag 528 
Pro lie Thr Asn Leu Pro Asp Gly Gly Asp Gly Phe Asp lie Gly Glu 
165 170 175 

atg etg agg atg atg gaa age gat cca cat aat gca ggt gga get gac 576 
Met Leu Arg Met Met Glu Ser Asp Pro His Asn Ala Gly Gly Ala Asp 
180 185 190 

get ggc atg ggg cag ccc tgg tat ctt gat gag ctg gat teg agt gtc 624 
Ala Gly Met Gly Gin Pro Trp Tyr Leu Asp Glu Leu Asp Ser Ser Val 
195 200 205 

ttg gag age atg etc cag cca gag cca gag cca gag cca gag cca ttc 672 
Leu Glu Ser Met Leu Gin Pro Glu Pro Glu Pro Glu Pro Glu Pro Phe 
210 215 220 

ctg atg tct gaa gaa ecg gac atg ttc cct get ggc ttc gaa age get 720 
Leu Met Ser Glu Glu Pro Asp Met Phe Leu Ala Gly Phe Glu Ser Ala 
225 230 235 240 

ggt ttc gtc gag ggt ctg gag egg eta aac tga 753 
Gly Phe Val Glu Gly Leu Glu Arg Leu Asn . ; . 

245 . 250 



<210> 10 
<211> 250 
<212> PRT 
<213> 2ea mays 

<400> 10 

Met Lys Gly Lys Gly Gly Pro Asp Asn Thr Gin Cys Gly Tyr Arg Gly 

1 5 10 15 

Val Arg Gin Arg Thr Trp Gly Lys Trp Val Ala Glu lie Arg Glu Pro 

20 . 25 30 

Asn Arg Val Asp Arg Leu Trp Leu Gly Thr Phe Pro Thr Ala Glu Asp 

35 40 45 

Ala Ala Arg Ala Tyr Asp Glu Ala Ala Arg Ala Met Tyr Gly Asp Leu 

50 55 . 60 

Ala Arg Thr Asn Phe Pro Gly Gin Asp Ala Thr Thr Ser Ala Gin- Ala 
65 . 70 IS 80 
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Ala 


Leu 


Ala 


Ser 


Thr 


Ser 


Ala 


Gin 


Ala 


Asp 


Pro 


Thr 


Ala 


Val 


Glu 


Ala 










85 










90 










95 




Leu 


Gin 


Thr 


Gly 


Thr 


Ser 


Cys 


Glu 


Ser 


Thr 


Thr 


Thr 


Ser 


Asn 


Tyr 


Ser 








100 










105 










110 






Asp 


lie 


Ala 


Ser 


Thr 


Ser 


His 


Lys 


Pro 


Glu 


Pro 


Glu 


Ala 


Ser 


Asp 


He 






115 










120 










125 








Ser 


Ser 


Ser 


Leu 


Lys 


Ala 


Lys 


Cys 


Pro 


Ala 


Gly 


Ser 


Cys 


Gly 


lie 


Gin 




130 










135 










14 0 










Asp 


Gly 


Thr 


Pro 


Ser 


Val 


Ala 


Asp 


Lys 


Glu 


Val 


Phe 


Gly 


Pro 


Leu 


Glu 


145 










150 










155 










160 


Pro 


He 


Thr 


Asn 


Leu 


Pro 


Asp 


Gly 


Gly 


Asp 


Gly 


Phe 


Asp 


He 


Gly 


Glu 










165 










170 










175 




Met 


Leu 


Arg 


Met 


Met 


Glu 


Ser 


Asp 


Pro 


His 


Asn 


Ala 


Gly 


Gly 


Ala 


Asp 








180 










185 










190 






Ala 


Gly 


Met 


Gly Gin 


Pro 


Trp 


Tyr 


Leu 


Asp 


Glu 


Leu 


Asp 


Ser 


Seir 


Val 






195 










200 










205 








Leu 


Glu 


Ser 


Met 


Leu 


Gin 


Pro 


Glu 


Pro 


Glu 


Pro 


Glu 


Pro 


Glu 


Pro 


Phe 




210 










215 










220 










Leu 


Met 


Ser 


Glu 


Glu 


Pro 


Asp 


Met 


Phe 


Leu 


Ala 


Gly 


Phe 


Glu 


Ser 


Ala 


225 










230 










235 










240 


Gly 


Phe 


Val 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Asn 















<210> 11 

<211> 20 

<212> DNA 

<213> Zea mays 

<400> 11 

atgaagggaa aaggggggcc 20 

<210> 12 

<211> 20 

<212> DNA 

<:213> Zea mays 

<400> 12 

tcagtttagc cgctccagac 20 

<210> 13 
<211> 1110 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1) . . . (1107) 

<400> 13 

atg gcg atg atg gtg gat ccc ccc aat ggc atg gca agt caa gga aag 48 

Met Ala Met Met Val Asp Pro Pro Asn Gly Met Ala Ser Gin Gly Lys 

1 5 10 15 



cat tac tat act atg tgg cag aca ctg ttt gag ate gac acc aag tac 96 

His Tyr Tyr Thr Met Trp Gin Thr Leu Phe Glu He Asp Thr Lys Tyr 
20 25 30 

gtg ccg ate aag ccc ate gga aga gga get tac ggt gtc gtt tgc tea 144 

Val Pro He Lys Pro He Gly Arg Gly Ala Tyr Gly Val Val Cys Ser 
35 40 45 

tct gtt aac cgc gag acc aat gaa aaa gtc gcg ata aaa aag ata aac 192 
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Ser Val Asn Arg Glu Thr Asn Glu Lys Val Ala He Lys Lys He Asn 
50 55 60 

aac gcc ttt gac aac cgz gtg gat gcg era agg acg ctg aga gag ctg 240 
Asn Val Phe Asp Asn Arg Val Asp Ala Leu Arg Thr Leu Arg Glu Leu 
65 70 75 80 

aaa etc ctt egg cac ctg agg cac gag aac gtr att get etc aag gac 288 
Lys Leu Leu Arg His Leu Arg His Glu Asn Val He Ala Leu Lys Asp 
85 90 95 

ata atg atg ect gcg cat aga agg age etc aag gac gtt tac ttg gtt 336 
He Met Met Pro Ala His Arg Arg Ser Phe Lys Asp Val Tyr Leu Val 
100 X05 110 

tat gag etc atg gac act gat ctg cat caa ata att aag tea tet caa 384 
Tyr Glu Leu Met Asp Thr Asp Leu His Gin He He Lys Ser Ser Gin 
115 120 125 

eca eta tec aat gac cac tgc eag tat ttc czt ttt eag ctg etc cga . 432 

Pro Leu Ser Asn Asp His Cys Gin Tyr Phe Leu Phe Gin Leu Leu Arg 
130 135 140 

ggc ctg aag tac etc cat tea gcc ggg ata etc cac aga gac eta aag 4 80 

Gly Leu Lys Tyr Leu His Ser Ala Gly He Leu His Arg Asp Leu Lys 
145 150 155 160 

eca ggg aac etc ctg gtc aac gca aac tgt gac ctg aag ata tgc gac 528 
Pro Gly Asn Leu Leu Val Asn Ala Asn Cys Asp Leu Lys He Cys Asp 
165 170 175 

ttc ggg etc gcc cge acg aac aac ace aag ggc eag ttc atg acg gag 576 
Phe Gly Leu Ala Arg Thr Asn Asn Thr Lys Gly Gin Phe Met Thr Glu 
180 185 190 

tac gtg gtg acc cge tgg tac agg gca ecc gag ctg ctg etc tgc tgc 624 
Tyr Val Val Thr Arg Trp Tyr Arg Ala Pro Glu Leu Leu Leu Cys^ Cys 
195 200 205 

gac aac tac ggc acg tee att gac gtc tgg tet gtg ggg tgc ata ttc 672 
Asp Asn Tyr Gly Thr Ser He Asp Val Trp Ser Val Gly Cys lie Phe 
210 215 220 

gcg gag ctg ctt ggc cge aag ccg ate etc cea gga acc gag tgc ctg 720 
Ala Glu Leu Leu Gly Arg Lys Pro He Phe Pro Gly Thr Glu Cys- Leu 
225 230 235 - 240 

aac eag etc aag etc ate gtc aac gtc etc ggc acc atg ggc gag gcc 768 
Asn Gin Leu Lys Leu He Val Asn Val Leu Gly Thr Met Gly Glu Ala 
245 250 255 

gac etc gcg ttc ate gac aac ccg aag gee cge aag tac ate aag tec 816 
Asp Leu Ala Phe He Asp Asn Pro Lys Ala Arg Lys Tyr He Lys Ser 
260 . 265 . 270 

ctt ccg tac gcc ccg ggc gee ecc ttc acc ggc atg tac ect eag gcg 864 
Leu Pro Tyr Ala Pro Gly Ala Pro Phe Thr Gly Met Tyr Pro Gin Ala 
275 280 285 

cac ect etc gee ate gac ctg ctg eag aag atg etc gtg ttc gac ccg 912 
His Pro Leu Ala He Asp Leu Leu Gin Lys Met Leu Val Phe Asp Pro 
290 295 . ' 300 
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tec aag agg ate age gtc acc gag gcg ctg gag cac ccg tac atg tct 960 
Ser Lys Arg lie Ser Val Thr Glu Ala Leu Glu. His Pro Tyr Met Ser 
305 310 315 320 

ccg etc tat gac ccg age gcg aac cct ccc gcg eag gtg ccg ate gac 1008 
Pro Leu Tyr Asp Pro Ser Ala Asn Pro Pro Ala Gin Val Pro lie Asp 
325 330 . 335 

etc gac ate gac gag aac etc ggc gtc gac atg ate agg gag atg atg 1056 
Leu Asp lie Asp Glu Asn Leu Gly Val Asp Met lie Arg Glu Met Met 
340 345 350 

tgg cag gag atg ate cac tac cae ccc gag gtc etc acg aga ate age 1104 
Trp Gin Glu Met lie His Tyr His Pro Glu Val Leu Thr Arg lie Ser 
355 360 365 

atg tga 1110 
Met 



<210> 14 
<211> 369 
<212> PRT 
<213> Zea 

<400> 14 
Met Ala Met Met 
1 

His Tyr Tyr Thr 
20 

Val Pro He Lys 
35 

Ser Val Asn Arg 
50 

Asn Val Phe Asp 
65 

Lys Leu Leu Arg 

He Met Met Pro 
100 

Tyr Glu Leu Met 
115 

Pro Leu Ser Asn 
130 

Gly Leu Lys Tyr 
145 

Pro Gly Asn Leu 

Phe Gly Leu Ala 
180 

Tyr Val Val Thr 
195 

Asp Asn Tyr Gly 
210 

Ala Glu Leu Leu 
225 

Asn Gin Leu Lys 

Asp Leu Ala Phe 
260 



mays 

Val Asp Pro Pro 
5 

Met Trp Gin Thr 

Pro He Gly Arg 
40 

Glu Thr Asn Glu 
55 

Asn Arg Val Asp 
70 

His Leu Arg His 
85 

Ala His Arg Arg 

Asp Thr Asp Leu 
120 

Asp His Cys Glr. 
135 

Leu His Ser Ala 
150 

Leu Val Asn Ala 
165 

Arg Thr Asn Asn 

Arg Trp Tyr Arg 
200 

Thr Ser He Asp 
215 

Gly Arg Lys Pro 
230 

Leu He Val Asn 
245 

He Asp Asn Pro 



Asn Gly Met Ala 
10 

Leu Phe Glu He 
25 

Gly Ala Tyr Gly 

Lys Val Ala He 
60 

Ala Leu Arg Thr 
75 

Glu Asn Val He 
90 

Ser Phe Lys Asp 
105 

His Gin He He 

Tyr Phe Leu Phe 
140 

Gly He Leu His 
155 

Asn Cys Asp Leu 
170 

Thr Lys Gly Gin 
185 

Ala Pro Glu Leu 

Val Trp Ser Val 
220 

He Phe Pro Gly 
235 

Val Leu Gly Thr 
250 

Lys Ala Arg Lys 
265 



Ser Gin Gly Lys 
15 

Asp Thr Lys Tyr 
30 

Val Val Cys Ser 
45 

Lys Lys He Asn 

Leu Arg Glu Leu 
80 

Ala Leu Lys Asp 
95 

Val Tyr Leu Val 
110 

Lys Ser Ser Gin 
125 

Gin Leu Leu Arg 

Arg Asp Leu Lys 
160 

Lys He Cys Asp 
175 

Phe Met Thr Glu 
190 

Leu Leu Cys Cys 
205 

Gly Cys He Phe 

Thr Glu Cys Leu 
240 

Met Gly Glu Ala 
255 

Tyr He Lys Ser 
270 
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Leu 


Pro 


Tyr 


Ala 


Pro 


Gly Ala 


Pro 


Phe 


Thr 


Gly 


Met 


Tyr 


Pro 


Gin 


Ala 






275 










280 










285 








His 


Pro 


Leu 


Ala 


He 


Asp 


Leu 


Leu 


Gin 


Lys 


Met. 


Leu 


Val 


Phe 


Asp 


Pro 




290 










295 










300 










Ser 


Lys 


Arg 


lie 


Ser 


Val 


Thr 


Glu 


Ala 


Leu 


Glu 


His 


Pro 


Tyr 


Met 


Ser 


305 










310 










315 










320 


Pro 


Leu 


Tyr 


Asp 


Pro 


Ser 


Ala 


Asn 


Pro 


Pro 


Ala 


Gin 


Val 


Pro 


He 


Asp 










325 










330 










335 




Leu 


Asp 


lie 


Asp 


Glu 


Asn 


Leu 


Gly 


Val 


Asp 


Met 


He 


Arg 


Glu 


Met 


Met 








340 










345, 










350 






Trp 


Gin 


Glu 


Met 


lie 


His 


Tyr 


His 


Pro" 


Glu 


Val 


Leu 


Thr 


Arg 


He 


Ser 






355 










360 










365 








Met 

































<210> 15 
<211> 20 
<212> DNA 
<213> Zea mays 



<400> 15 

atggcgatga tggtggatcc 20 

<210> 16 
<211> 20 
<212> DNA 
<213> Zea mays 

<400> 16 

tcacatgctg attctcgtga 20 

<:210> 17 

<211> 1119 

<212> DNA 

<213> Zea mays 



<220> 

<221> CDS 

<222> (1) . . . (H16) 



<400> 17 

atg gat tec tec ggc ggc ggc ggt gga gcg cag ate aag ggt atg gcg 48 

Met A:;p Scr Ser Gly Gly Gly Gly Gly Ala Gin He Lys Gly. Met. Ala 

1 5. . 10 15 : 

acg car' gng g:jc cgc tac'gtg ctg tac aac gtg tac gga aac etc tec 96 

Thr Hi3 Gly Gly Arg Tyr Val Leu Tyr Asn Val Tyr Gly Asn Leu Phe 

.20 25 30 

gag gtc tec tec aag tac gcc cca ccc ate cgc ccc ate ggt cgc ggc 144 

Glu Val Ser Ser Lys Tyr Ala Pro Pro He Arg Pro He Gly Arg Gly 

35 40 45 

gcc tac ggc att gtc tgc gcg get gtc aac teg cag tea ggg gag; gag 192 

Ala T>'r Gly He Val Cys Ala Ala Val Asn Ser Gin Ser Gly Glu Glu 
50 55 60 

gtt gcg ate aag aag gtt ggc aat gcg ccc gac aac cac ate gac gcc 240 

Val Ala He Lys Lys Val Gly Asn Ala Phe Asp Asn His lie Asp. Ala • 

65 70 75 80 

aag egg acg etc agg gaa ate aag ctg ctg cgc cac atg gac cat gag 288 
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Lys Arg Thr Leu Arg Glu He Lys Leu Leu Arg His Met Asp His Glu 
85 90 95 

aac ate ctt gcc tta aag gat gta att egg ccc cca act aga gag aac 336 

Asn He Leu Ala Leu Lys Asp Val He Arg Pro Pro Thr Arg Glu Asn 
100 105 110 

ttt aat gac gtg tac att gtt act gag tta atg gat aca gat etc cat 384 

Phe Asn Asp Val Tyr He Val Thr Glu Leu Met Asp Thr Asp Leu His 
115 120 125 

cag ate gta cgc tea aat eag cca ttg act gat gat cat tgc cag tac 432 

Gin He Val Arg Ser Asn Gin Pro Leu Thr Asp Asp His Cys Gin Tyr 
130 135 140 

ttc ttg tat cag ttg tta cga ggg eta aaa tat gtg cac tea gea aat 480 

Phe Leu Tyr Gin Leu Leu Arg Gly Leu Lys Tyr Val His Ser Ala Asn 

145 150 155 160 

ata ttg cac cgc gat ctg aag ccg age aat ttg ttc eta aat gea aat 528 

He Leu His Arg Asp Leu Lys Pro Ser Asn Leu Phe Leu Asn Ala Asn 
165 170 175 

tgt gac etc aag att gea gac ttt ggg ctt gea agg ace act tea gag 576 

Cys Asp Leu Lys He Ala Asp Phe Gly Leu Ala Arg Thr Thr Ser Glu 
180 185 190 

aca gat etc atg aca gag tat gtg gtc act egt tgg tac egg gea cca 624 

Thr Asp Leu Met Thr Glu Tyr Val Val Thr Arg Trp Tyr Arg Ala Pro 
195 200 205 

gag ctg ctg ttg aac tgt tea cag tat act get gcc att gat gtc tgg 672 

Glu Leu Leu Leu Asn Cys Ser Gin Tyr Thr Ala Ala He Asp Val Trp 
210 215 220 

tea gtt gga tgc ata eta ggt gaa ate gtt act egt caa ecc ctg ttt 720 

Ser Val Gly Cys He Leu Gly Glu He Val Thr Arg Gin Pro Leu Phe 

225 230 ^ 235 240 

cct gga egg gat tac ate eag caa tta aaa ttg ate act gag etc ata 768 

Pro Gly Arg Asp Tyr lie Gin Gin Leu Lys Leu He Thr Glu Leu He 
245 250 255 

ggc tet cca gat gat gea age ctg gga ttt ctt cga agt gat aat gea 816 

Gly Ser Pro Asp Asp Ala Ser Leu Gly Phe Leu Arg Ser Asp Asn Ala 
260 265 270 

aaa aga tac atg aaa caa eta cca cag ttt cca aga cag gac ttc cgc 864 

Lys Arg Tyr Met Lys Gin Leu Pro Gin Phe Pro Arg Gin Asp Phe Arg 
275 280 285 

ctg egt ttc cgc aac atg tet cct ggc gea gtc gat ttg ttg gaa agg 912 

Leu Arg Phe Arg Asn Met Ser Pro Gly Ala Val Asp Leu Leu Glu Arg 
290 295 300 

atg ctt gtg ttt gat cca age aga egg att aca gtt gat gag get ctg 960 

Met Leu Val Phe Asp Pro Ser Arg Arg He Thr Val Asp Glu Ala Leu 

305 310 315 320 

cat cat cca tac ttg get tea ctt cat gag ate aat gat gaa cct acc 1008 

His His Pro Tyr Leu Ala Ser Leu His Glu He Asn Asp Glu Pro Thr 
325 330 335 
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tgc cct gca cct ttc age ttt gar ttr gag caa cca tec ttt aca gaa 1056 
Cys Pro Ala Pro Phe Ser Phe As? Phe Glu Gin Pro Ser Phe Thr Glu 
340 345 350 

gcg cat ata aaa gaa etc ate tag agg gaa tct tta gea ttt aac eca 1104 
Ala His lie Lys Glu Leu He Trp Arg Glu Ser Leu Ala Phe Asn Pro 
355 360 365 

gag cct ccc tac taa 1119 
Glu Pro Pro Tyr 
370 



<210> 18 

<211> 372 

<212> PRT 

<213> Zea 

<400> IS 
Met Asp Ser Ser 
1 

Thr His Gly Gly 
20 

Glu Val Ser Ser 
35 

Ala Tyr Gly He 
50 

Val Ala He Lys 
65 

Lys Arg Thr Leu 

Asn He Leu Ala 
100 

Phe Asn Asp Val 
115 

Gin He Val Arg 
130 

Phe Leu Tyr Gin 
145 

He Leu His Arg 

Cys Asp Leu Lys 
180 

Thr Asp. Leu Met 
195 

Glu Leu Leu Leu 
210 

Ser Val Gly Cys 
225 

Pro Gly Arg Asp 

Gly Ser Pro Asp 
260 

Lys Arg Tyr Met 
275 

Leu Arg Phe Arg 
290 

Met Leu Val Phe 
305 

His His Pro Tyr 



mays 



Gly 


Gly Gly 


Gly 


5 








ArQ 


Tyr 


Val 


Leu 


Lys 


Tyr 


Ala 


Pro 








40 


Val 


Cys 


Ala 


Ala 






55 




Lys 


Val 


Gly 


Asn 




70 






Arg 


Glu 


He 


Lys 


85 








Leu 


Lys 


Asp 


Val 


Tyr 


He 


Val 


Thr 








120 


Ser 


Asn 


Gin 


Pro 






135 




Leu 


Leu 


Arg 


Gly 




150 






Asp 


Leu 


Lys 


Pro 


165 








He 


Ala 


Asp 


Phe 


Thr 


Glu 


Tyr 


Val 








200 


Asn 


cys 


Ser 


Gin 






215 




He 


Leu 


Gly 


Glu 




230 






Tyr 


He 


Gin 


Gin 


245 








Asp 


Ala 


Ser 


Leu 


Lys 


Gin 


Leu 


Pro 








280 


Asn 


Met 


Ser 


Pro 






295 




Asp 


Pro 


Ser 


Arg 




310 






Leu 


Ala 


Ser 


Leu 


325 









Gly Ala Gin He 
10 

Tyr Asn Val Tyr 
25 

Pro He Arg Pro 

Val Asn Ser Gin 
60 

Ala Phe Asp Asn 
75 

Leu Leu Arg His 
90 

lie Arg Pro Pro 
105 

Glu Leu Met Asp 

Leu Thr Asp Asp 
140 

Leu Lys Tyr Val 
155 

Ser Asn Leu Phe 
170 

Gly Leu Ala Arg 
185 

Val Thr Arg Trp 

Tyr Thr Ala Ala 
220 

He Val Thr Arg 
235 

Leu Lys Leu He 
250 

Gly Phe Leu Arg 
265 

Gin Phe Pro Arg 

Gly Ala Val Asp 
300 

Arg He Thr Val 
315 

His . Glu He Asn 
330 



Lys 


Gly 


Met 


Ala 






15 




Gly 


Asn 


Leu 


Phe 




30 






He 


Gly Arg Gly 


45 








Ser 


Gly 


Glu 


Glu 


His 


He 


Asp 


Ala 








80 


Met 


Asp 


His 


Glu 






95 




Thr 


Arg 


Glu 


Asn 




110 






Thr 


Asp 


Leu 


His 


125 








His 


Cys 


Gin 


Tyr 


His 


Ser 


Ala 


Asn 








160 


Leu 


Asn 


Ala 


Asn 






175 




Thr 


Thr 


Ser 


Glu 




190 






Tyr 


Arg 


Ala 


Pro 


205 








He 


Asp 


Val 


Trp 


Gin 


Pro 


Leu 


Phe' 








240 


Thr 


Glu 


Leu 


He 






255 




Ser 


Asp 


Asn 


Ala 




270 






Gin 


Asp 


Phe 


Arg 


285 








Leu 


Leu 


Glu 


Arg 


Asp 


Glu 


Ala 


Leu 








320 


Asp 


Glu 


Pro 


Thr 



335 
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Cys Pro Ala Pro Phe Ser Phe Asp ?he Glu Gin Pro Ser Phe Thr Glu 

340 345 350 

Ala His lie Lys Glu Leu lie Trp Arg Glu Ser Leu Ala Phe Asn Pro 

355 360 365 

Glu Pro Pro Tyr 
370 



<:210> 


19 


<211> 


20 


<212> 


DNA 


<213> 


Zea mays 


<400> 


19 


atggattcct ccggcggcgg 


<210> 


20 


<211> 


20 


<212> 


DNA 


<213> 


Zea mays 


<400> 


20 


ttagtaggga ggctctgggt 


<210> 


21 


<211> 


1113 


<212> 


DNA 


<213> 


Zea mays 


<220> 




<221> 


CDS 


<222> 


(1) . . . (1110) 


<400> 


21 


atg gcg atg 


atg gtg gat 


Met Ala Met 


Met Val Asp 


1 


5 



20 



20 



48 



96 



10 15 

cat tac tac tct atg tgg cag acc tta ntt gag ata gac acc aaa tat 
His Tyr Tyr Ser Met Trp Gin Thr Leu Phe Glu He Asp Thr Lys Tyr 
20 25 30 

gtg ccg ate aag ccc att ggt cga gga get tat ggg ata gtt tgt tea 144 
Val Pro lie Lys Pro He Gly Arg Gly Ala Tyr Gly He Val Cys Ser 
35 40 45 

tec att ajL cat gaa aea aat gag aaa gta gca ata aag aag ata cac 192 
Ser JXe Acn Arg Glu Thr Asn Glu Lys Val Ala He Lys Lys He His 
50 55 60 

aac gtt ttc gac aae cgt gtg gac gca eta egg acc ttg egg gag ctg 
Asn Val Phe Asp Asn Arg Val Asp Ala Leu A_rg Thr Leu Arg Glu Leu 
65 70 75 80 

aaa etc ctt cgc cat etc egg cat gag aat gtc att get ttg aag gat 
Lys Leu Leu Arg His Leu Arg His Glu Asn Val He Ala Leu Lys Asp 
85 90 95 

ata atg acg cca ata cac agg aga age ttt aag gat gtg tac ttg gta 
He Met Met Pro He His Arg Arg Ser Phe Lys Asp Val Tyr Leu Val 
100 105 110 

tac gaa etc atg gat act gat ttg cac cag ata ate aaa tea ect cag 



240 



288 



336 



384 
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Tyr Glu Leu Met Asp Thr Asp Leu His Gin He He Lys Ser Pro Gin 
115 120 125 

ggc ctt tec aat gac cac tgc cag tat ttt ctt ttt cag ttg etc cga 432 
Gly Leu Ser Asn Asp His Cys Gin Tyr Phe Leu Phe Gin Leu Leu Axg 
130 135 140 

gga etc aaa tat etc eat tea gca gaa ata etc cac aga gac eta aaa 480 
Gly Leu Lys Tyr Leu His Ser Ala Glu lie Leu His Arg Asp Leu Lys 
145 150 155 160 

cct gga aac etg ctg gtg aat gca aat tgt gat ctg aag ata tgt gat 528 
Pro Gly Asn Leu Leu Val Asn Ala Asn Cys Asp Leu Lys He Cys Asp 
165 170 175 

ttt ggt etc gca egt aca aac agt age aaa ggc cag ttc atg act gaa 576 
Phe Gly Leu Ala Arg Thr Asn Ser Ser Lys Gly Gin Phe Met Thr Glu 
180 185 190 

tac gtc gtc acc egc tgg tac aga get cct gag etg etc etc tgc tge 624 
Tyr Val Val Thr Arg Trp Tyr Arg Ala Pro Glu Leu Leu Leu Cys Cys 
195 200 205 

gac aac tac ggc aca tec ata gac gtc tgg tct gtt ggg tgc ate ttt 672 
Asp Asn Tyr Gly Thr Ser He Asp Val Trp Ser Val Gly Cys He Phe 
210 215 220 

get gag etc ctt ggc cgc aag cea ata ttt cca gga act gaa tgc ctg 720 
Ala Glu Leu Leu Gly Arg Lys Pro He Phe Pro Gly Thr Glu Cys .Leu 
225 230 235 240 

aat caa etc aag etc ata gtg aac gtc etc ggc aca atg agt gag get 768 
Asn Gin Leu Lys Leu He Val Asn Val Leu Gly Thr Met Ser Glu Ala 
245 250 255 

gac eta gag ttc ate gac aac cca aag get egg aga tac ate aag tec 816 
Asp Leu Glu Phe He Asp Asn Pro Lys Ala Arg Arg Tyr He Lys Ser 
260 265 270 

ctt ecc tat ace cct ggt gtt cct etc gta agt atg tac cea cat gcg 864 
Leu Pro Tyr Thr Pro Gly Val Pro Leu Val Ser Met Tyr Pro His Ala 
275 280 285 

cac cct ctt gee att gat ctg ttg cag aag atg etc ate ttc gac cec 912 
His Pro Leu Ala He Asp Leu Leu Gin Lys Met Leu He Phe Asp Pro 
290 295 300 

acc aaa agg ate agt gtc acc gag get etc gag cac cct tac atg tec 960 
Thr Lys Arg He Ser Val Thr Glu Ala Leu Glu His Pro Tyr Met Ser 
305 310, 315 320 

cct ctg tat gac cca age gca aat ecc cca gee caa gtg ccc ate gat 1008 
Pro Leu Tyr Asp Pro Ser Ala Asn Pro Pro Ala Gin Val Pro He Asp 

325 330 - 335. . . - 

ctg gac ata gac gaa aac ate age tea gag atg ate egg ,gaa atg atg 1056 
Leu Asp He Asp Glu Asn He Ser Ser Glu Met He Arg Glu Met Met 
340 345 350 

tgg cag gag atg ctt cac tac cac cct gaa gtt gee aca gca ata age 1104 
Trp Gin Glu Met Leu His Tyr His Pro Glu Val Ala Thr Ala He Ser 
355 360 365 . 
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atg tea tga 
Met Ser 
370 



1113 



<210> 22 
<211> 370 
<212> PRT 
<213> Zea mays 



<400> 22 



Met 


Ala 


Met 


Met 


Val 


Asp 


Pro 


Pro 


Asn 


Gly 


He 


Gly 


Asn 


Gin 


Gly 


Lys 


1 








5 










10 










15 




His 


Tyr 


Tyr 


Ser 


Met 


Trp 
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310 










315 










320 


Pro 


Leu 


Tyr Asp 


Pro 


Ser 


Ala 


Asn 


Pro 


Pro 


Ala 


Gin 


Val 


Pro 


He 


Asp 










325 










330 










335 




Leu 


Asp 


He 


Asp 


Glu 


Asn 


He 
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Glu 
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He 
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<211> 20 
<212> DNA 
<213> Zea mays 

<400> 23 

atggcgatga tggtggatcc 20 

<210> 24 
<211> 20 
<212> DNA 
<213> 2ea mays 

<400> 24 

tcatgacatg cttattgctg 20 

<210> 25 
<211> 1125 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1) . . . (1122) 

<400> 25 

atg age gga gga ggc gtg gac gga get ccg gtc gcc gag ttc egg cag 48 

Met Ser Gly Gly Gly Val Asp Gly Ala Pro Val Ala Glu Phe Arg Gin 

1 5 10 15 

acg gtg acg cac ggc ggc egg ttc ctg cag tac aac ate ttc ggc aac 96 
Thr Val Thr His Gly Gly Arg Phe Leu Gin Tyr Asn He Phe Gly Asn 
20 25 30 

ctg ttc gag ate acg cac aag tac cag ccc ccc ate atg ccc ate ggc 144 
Leu Phe Glu lie Thr His Lys Tyr Gin Pro Pro He Met Pro He Gly 
35 40 45 

cgc ggc gcc tac ggg ate gtc tgc teg gtg atg aac tec gag acg aag 192 
Arg Gly Ala Tyr Gly He Val Cys Ser Val Met Asn Ser Glu Thr Lys 
50 55 60 

gag atg gtg gcc ate aag aag ate gcc aac gcc ttc gac aac cac atg 240 
Glu Met Val Ala He Lys Lys He Ala Asn Ala Phe Asp Asn His Met 
65 . 70 75 80 

gac gcc aag cgc acg etc egg gag ate aag ctg ctg agg cac etc gac 288 
Asp Ala Lys Arg Thr Leu Arg Glu He Lys Leu Leu Arg His Leu Asp 
85 . 90 95 

cac gag aac ate ate ggc ate agg gac gtg ate ccg ccg ccc gtc ccg 336 
His Glu Asn He He Gly He Arg Asp Val He Pro Pro Pro Val Pro 
100 105 110 

caa gcg ttc aac gac gtg tac ate ggg acg gag ctg atg gac acg gac 384 
Gin Ala Phe Asn Asp Val Tyr He Gly Thr Glu Leu Met Asp Thr Asp 
115 120 125 

ctg cac cac ate ate egg tec aac cag gag etc teg gag gag cac tec 432 
Leu His His He He Arg Ser Asn Gin Glu Leu Ser Glu Glu His Ser 
130 135 140 

cag tac ttc atg tac cag ate etc cgc ggg etc aag tac ate cac tec 480 
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Gin Tyr Phe Met Tyr Gin lie Leu Arg Gly Leu Lys Tyr lie His Ser 
145 150 155 160 

gcc aac gtg ate cac cgc gac etc aag ccc age aac ctg ctg gtg aac 528 
Ala Asn Val He His Arg Asp Leu Lys Pro Ser Asn Leu Leu Val Asn 
165 170 175 

gcc aac tgc gac etc aag acc tgc gac zzc gag ctg gcg cgc ccg tec 576 
Ala Asn Cys Asp Leu Lys lie Cys Asp Phe Gly Leu Ala Arg Pro Ser 
180 185 190 

tec gag age gac atg atg aeg gag tac gtg gtc acg cgc tgg tac cgc 624 
Ser Glu Ser Asp Met Met Thr Glu Tyr Val Val Thr Arg Trp Tyr Arg 
195 200 205 

gcg ccc gag ctg ctg etc aac tee acc gac tac teg gcg gcc ate gac 672 
Ala Pro Glu Leu Leu Leu Asn Ser Thr Asp Tyr Ser Ala Ala He Asp 
210 215 220 

gtc tgg tec gtc ggc tgc ate ttc atg gag etc ate aac cgc cag ccg 720 
Val Trp Ser Val Gly Cys He Phe Met Glu Leu He Asn Arg Gin Pro 
225 230 235 240 

etc ttc. ccc gga cgc gac cac atg eac cag atg cgc etc ata acc gag 768 
Leu Phe Pro Gly Arg Asp His Met His Gin Met Arg Leu He Thr Glu 
245 250 255 

gtg ate ggg acg ccc acg gac gat gag etc ggg ttc ate egg aac gag 816 
Val He Gly Thr Pro Thr Asp Asp Glu Leu Gly Phe He Arg Asn Glu 
260 265 270 

gac gcg egg aag tac atg cgc cac etc ccg cag ttc ccg cgc egg ccg 864 
Asp Ala Arg Lys Tyr Met Arg His Leu Pro Gin Phe Pro Arg Arg Pro 
275 280 285 

ttc gtc age ctg ttc ccg egg atg cag ccc gtc gcg ctg gac etc ate 912 
Phe Val Ser Leu Phe Pro Arg Met Gin Pro Val Ala Leu Asp Leu He 
290 295 300 

gag egg atg etc acc ttc aac ccg ctg cag agg ate aca gtg gaa gag 960 
Glu Arg Met Leu Thr Phe Asn Pro Leu Gin A.rg He Thr Val Glu Glu 
305 310 315 320 

gcg ctg gag cac ccg tac ctg gaa egg eta cac gac gtc gcc gac gag 1008 
Ala Leu Glu His Pro Tyr Leu Glu Arg Leu His Asp Val Ala Asp Glu* 
325 330 335 

ccc ate tgc acg gac ccg ttc teg ttc gac ttc gag cag cag get ctg 1056 
Pro He Cys Thr Asp Pro Phe Ser Phe Asp Phe Glu Gin Gin Ala Leu 
340 345 350 

acg gaa gac caa atg aag cag ctg ata ttc aac gag gcc atg gaa etc 1104 
Thr Glu Asp Gin Met Lys Gin Leu He Phe Asn Glu Ala Met Glu Leu 
355 360 365 

aac ccc aac ttc cga tac tag 1125 
Asn Pro Asn Phe Arg Tyr 
370 



<210> 26 
<211> 374 
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21 



<212> PRT 
<213> Zea mays 

<400> 26 

Met Ser Gly Gly Gly Val Asp Gly Ala Pro Val Ala Glu Phe Arg Gin 

15 10 15 

Thr Val Thr His Gly Gly Arg Phe Leu Gin T/r Asn He Phe Gly Asn 

20 25 • 30 

Leu Phe Glu He Thr His Lys Tyr Gin Pro Pro He Met Pro He Gly 

35 40 45 

Arg Gly Ala Tyr Gly He Val Cys Ser Val Met Asn Ser Glu Thr Lys 

50 55 60 

Glu Met Val Ala He Lys Lys He Ala Asn Ala Phe Asp Asn His Met 
65 70 75 80 

Asp Ala Lys Arg Thr Leu Arg Glu He Lys Leu Leu Arg His Leu Asp 

85 90 95 

His Glu Asn He He Gly He Arg Asp Val He Pro Pro Pro Val Pro 

100 105 110 

Gin Ala Phe Asn Asp Val Tyr He Gly Thr Glu Leu Met Asp Thr Asp 

115 120 125 

Leu His His He He Arg Ser Asn Gin Glu Leu Ser Glu Glu His Ser 

130 135 140 

Gin Tyr Phe Met Tyr Gin He Leu Arg Gly Leu Lys Tyr He His Ser 
145 150 155 160 

Ala Asn Val He His Arg Asp Leu Lys Pro Ser Asn Leu Leu Val Asn 

165 170 175 

Ala Asn Cys Asp Leu Lys He Cys Asp Phe Gly Leu Ala Arg Pro Ser 

180 185 190 

Ser Glu Ser Asp Met Met Thr Glu Tyr Val Val Thr Arg Trp Tyr Arg 

195 200 205 

Ala Pro Glu Leu Leu Leu Asn Ser Thr Asp Tyr Ser Ala Ala He Asp 

210 215 ' 220 

Val Trp Ser Val Gly Cys He Phe Met Glu Leu He Asn Arg Gin Pro 
225 230 235 240 

Leu Phe Pro Gly Arg Asp His Met His Gin Met Arg Leu He Thr Glu 

245 250 255 

Val He Gly Thr Pro Thr Asp Asp Glu Leu Gly Phe He Arg Asn Glu 

260 265 270 

Asp Ala Arg Lys Tyr Met Arg His Leu Pro Gin Phe Pro Arg Arg Pro 

275 280 285 

Phe Val Ser Leu Phe Pro Arg Met Gin Pro Val Ala Leu Asp Leu He 

290 295 300 

Glu Arg Met Leu Thr Phe Asn Pro Leu Gin Arg He Thr Val Glu Glu 
305 310 315 . .. 320 

Ala Leu Glu His Pro Tyr Leu Glu Arg Leu His Asp Val Ala Asp Glu. 

325 330 335 

Pro He Cys Thr Asp Pro Phe Ser Phe Asp Phe Glu Gin Gin Ala Leu 

340 345 350. 

Thr Glu Asp Gin Met Lys Gin Leu He Phe Asn Glu Ala Met Glu Leu 

355 360 365 

Asn Pro Asn Phe Arg Tyr 
370 

<210> 27 
<2H> 20 
<212> DNA 
<213> Zea mays 

<400> 27 

atgagcggag gaggcgtgga 20 
<210-> 28 
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<211> 20 
<212> DNA 
<213> Zea mays 

<400> 28 
ctagtatcgg aagttggggt 

<210> 29 
<211> 1197 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1) - . - (1194) 

<400> 29 

atg gac ggc ggg ggg cag ccc ccg gac acc gag atg teg gag gcc ggc 

Met Asp Gly Gly Gly Gin Pro Pro Asp Thr Glu Met Ser Glu Ala Gly 
15 10 15 

gcg ggc ggc gga ggg cag ccg ccg cag cag cca ctg ccg ccg gtg ggc 
Ala Gly Gly Gly Gly Gin Pro Pro Gin Gin Pro Leu Pro Pro Val Gly 
20 25 30 

ggg ggc gtg atg ttg gac aac ate cag gcg acg etc age cae ggc ggc 
Gly Gly Val Met Leu Asp Asn lie Gin Ala Thr Leu Ser His Gly Gly 
3 5 -40 4 5 

cgc ttc ate cag tac aac ate tte ggc aac gtg ttc gag gtc acc gcc 
Arg Phe He Gin Tyr Asn He Phe Gly Asn Val Phe Glu Val Thr Ala 
50 55 60 

aag tac aag ccc ccc gtc etc ccc ate ggc aag ggc gcc tac ggc ate 
Lys Tyr Lys Pro Pro Val Leu Pro He Gly Lys Gly Ala Tyr Gly He 



65 



70 75 80 



tea aat caa get ttg tea gag gag cae tgt cag tat ttt ctt tat caa 

Ser Asn Gin Ala Leu Ser Glu Glu His Cys Gin Tyr Phe Leu Tyr Gin 

165 170 175 

att ctt cgc ggc ttg aag tat ata cat tea gca aat gtt ctt cac cgt 



20 



48 



96 



144 



192 



240 



288 



gtc tge teg gcg etc aac tec gag acg gcg gag cag gtg gcc ate aag 

Val Cys Ser Ala Leu Asn Ser Glu Thr Ala Glu Gin Val Ala He Lys 
85 90 95 

aag ate gcc aac gcc ttc gac aac aag ate gat gcc aag cgc acg etc 

Lys He Ala Asn Ala Phe Asp Asn Lys He Asp Ala Lys Arg Thr Leu 
. 100 105 110 

cgc gag ate aag ctg etc cgc cac atg gac cae gag aat att gtt gca 

Arg Glu He Lys Leu Leu Arg His Met Asp His Glu Asn He Val Ala 
- 115 120 125 

ata agg gga ate ata cct cet gcg cag agg get" gca ttc aat gat gtg 
He Arg Gly He He Pro Pro Ala Gin Arg Ala Ala Phe Asn Asp Val 
130 135 140 

tat att gca tat gaa ttg atg gat act gat ctg cat caa att att cgt 480 

Tyr He Ala Tyr Glu Leu Met Asp Thr Asp Leu His Gin He He Arg 

145 150 155 160 



336 



384 



432 



528 



576 
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lie Leu Arg Gly Leu Lys Tyr lie His Ser Ala Asn Val Leu His Arg 
180 185 190 

gac ttg aag cct age aat ctt ctr teg aat gca aac tgt gac etc aag 624 
Asp Leu Lys Pro Ser Asn Leu Leu Leu Asn Ala Asn Cys Asp Leu Lys 
195 200 205 

ata tgt gat ttt ggg ctt get cgc acc acc tea gaa act gat ttt atg 672 
lie Cys Asp Phe Gly Leu Ala Arg Thr Thr Ser Glu Thr Asp Phe Met 
210 215 220 

act gaa tat gtt gtc aca aga tag tat aga gca cca gag ctt ttg ttg 720 
Thr Glu Tyr Val Val Thr Arg Trp Tyr Arg Ala Pro Glu Leu Leu Leu 
225 230 235 240 

aac tec tea gaa tat act get gcc ate gat gtg tgg tet gtg ggc tgt 768 
Asn Ser Ser Glu Tyr Thr Ala Ala lie Asp Val Trp Ser Val Gly Cys 
245 250 255 

ata ttt atg gaa ctg atg gac cgc aaa ecc ttg ttt cct gga aga gac 816 
r.le Phe Met Glu Leu Met Asp Arg Lys Pro Leu Phe Pro Gly Arg Asp 
260 265 270 

cat gtc cat cag eta cgt ctt eta atg gag etc att ggc acg cca aat 864 
His Val His Gin Leu Arg Leu Leu Met Glu Leu lie Gly Thr Pro Asn 
275 280 285 

gsg ggt gat ctt gat ttt gta aat gaa aat gca aga aga tat ate cgc 912 
Glu Gly Asp Leu Asp Phe Val Asn Glu Asn Ala Arg Arg Tyr lie Arg 
290 295 300 

caa ctt cct cgt cac cct aga cag tec tta cct gaa aaa ttt cca cat 960 
Gin Leu Pro Arg His Pro Arg Gin Ser Leu Pro Glu Lys Phe Pro His 
305 310 315 320 

gta caa ccc tta gca att gac ctg gtg gaa aag atg ctg act ttt gat 1008 
Val Gin Pro Leu Ala lie Asp Leu Val Glu Lys Met Leu Thr Phe Asp 
325 330 335 

cct aga cag aga ata act gtt gaa ggc gca ctt gca cac cct tac ttg 1056 
Pro Arg C 1 r. Arg lie Thr Val Glu Gly Ala Leu Ala His Pro Tyr Leu 
340 345 350 

gca cca ctt cat gac ata agt gat gag cca ggc tgc tea atg ccc ttc 1104 
Ala Sc: Leu His Asp lie Ser Asp Glu Pro Gly Cys Ser Met Pro Phe 
355 360 365 

age ttt gac ttc gag cag cat gca ttg tct gaa gaa caa atg aag gat 1152 
Ser Phe Asp Phe Glu Gin His Ala Leu Ser Glu Glu Gin Met Lys Asp 
370 375 380 

ctt ate tac caa gag get ctt gca ttc aac cct gat tat cag 1194 

Leu lie Tyr Gin Glu Ala Leu Ala Phe Asn Pro Asp Tyr Gin 

385 390 395 

tag 1197 

<210> 30 

<211> 398 

<212> PRT 

<213> Zea mays 
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100 


Arg 


Glu 


He 
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115 




lie 


Arg 


Gly 


He 




130 
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He 
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Tyr 


145 
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Asn 


Gin 


Ala 


He 


Leu 
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180 


Asp 


Leu 


Lys 


Pro 






195 




He 


Cys 


Asp 


Phe 




210 






Thr 


Glu 


Tyr 


Val 


225 








Asn 


Ser 


Ser 


I^XU 


He 


Phe 


Met 


Glu 








260 


His 


Val 


His 


Gin 






275 




Glu 


Gly Asp 


Leu 




290 






Gin 


Leu 


Pro 


Arg 


305 








Val 


Gin 


Pro 


Leu 


Pro 


Arg 


Gin 


Arg 








340 


Ala 


Ser 


Leu 


His 






355 




Ser 


Phe 


Asp 


Phe 




370 






Leu 


He 


Tyr 


Gin 



385 



Gly Gin Pro Pro 
5 

Gly Gin Pro Pro 

Leu Asp Asn He 
40 

Tyr Asn He Phe 
55 

Pro Val Leu Pro 
70 

Leu Asn Ser Glu 
85 

Ala Phe Asp Asn 

Leu Leu Arg His 
120 

He Pro Pro Ala 
135 

Glu Leu Met Asp 
150 

Leu Ser Glu Glu 
165 

Leu Lys Tyr He 

Ser Asn Leu Leu 
200 

Gly Leu Ala Arg 
215 

Val Thr Arg Trp 
230 

Tyr Thr Ala Ala 
245 

Leu Met Asp Arg 

Leu Arg Leu Leu 
280 

Asp Phe Val Asn 
295 

His Pro Arg Gin 
310 

Ala He Asp Leu 
325 

He Thr Val Glu 

Asp He Ser Asp 
360 

Glu Gin His Ala 
375 

Glu Ala Leu Ala 
390 



Asp Thr Glu Met 
10 

Gin Gin Pro Leu 
25 

Gin Ala Thr Leu 

Gly Asn Val Phe 
60 

He Gly Lys Gly 
75 

Thr Ala Glu Gin 
90 

Lys He Asp Ala 
105 

Met Asp His Glu 

Gin Arg Ala Ala 
140 

Thr Asp Leu His 
155 

His Cys Gin Tyr 
170 

His Ser Ala Asn 
165 

Leu Asn Ala Asn 

Thr Thr Ser Glu 
220 

Tyr Arg .Ala Pro 
235 

He Asp Val Trp 
250 

Lys Pro Leu Phe 
265 

Met Glu Leu He 

Glu Asn Ala Arg 
300 

Ser Leu Pro Glu 
315 

Val" Glu Lys Met 
330 

Gly Ala Leu Ala 
345 

Glu Pro Gly Cys 

Leu Ser Glu Glu 
380 

Phe Asn Pro Asp 
395 



Ser Glu Ala Gly 
15 

Pro Pro Val Gly 
30 

Ser His Gly Gly 
45 

Glu Val Thr Ala 

Ala Tyr Gly He 
80 

Val Ala He Lys 
95 

Lys Arg Thr Leu 
110 

Asn He Val Ala 
125 

Phe Asn Asp Val 

Gin He He Arg 
160 

Phe Leu Tyr Gin 
175 

Val Leu His Arg 
190 

Cys Asp Leu Lys 
205 

Thr Asp Phe Met 

Glu Leu Leu Leu 
240 

Ser Val Gly Cys 
255 

Pro Gly Arg Asp 
270 

Gly Thr Pro Asn 
285 

Arg Tyr lie Arg 

Lys Phe Pro His 
320 

Leu Thr Phe Asp 
335 

His Pro Tyr Leu 

35 0^^ 
Ser Met Pro Phe 
3 65 

Gin Met Lys Asp 
Tyr Gin 



<210> 31 
<211> 20 
<212> DNA 
<213> Zea mays 

<400> 31 

atggacggcg gggggcagcc 20 



<210> 32 
<211> 20 
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<212> DNA 
<213> 2ea mays 

<400> 32 

ctactgataa tcagggttga 20 

<210> 33 
<211> 1197 
<212> DNA 
<213> 2ea mays 

<220> 

<221> CDS 

<222> {!)... (1194) 



48 



<400> 33 

atg gac ggc ggg ggg cag ccc ccg gac acg gag atg aca gac gcc ggc 

Met Asp Gly Gly Gly Gin Pro Pro Asp Thr Glu Met Thr Asp Ala Gly 

15 10 15 

ttg ggc ggc ggg ggg caa ccg ccg ccg ccg ccg caa cag ccg gcg ggc 96 

Leu Gly Gly Gly Gly Gin Pro Pro Pro Pro Pro Gin Gin Pro Ala Gly 

20 25 30 

ggg gcc ggg atg atg gag aac ate cac gcg acg etc age cac ggt ggc 144 

Gly Ala Gly Met Met Glu Asn lie His Ala Thr Leu Ser His Gly Gly 

35 40 45 

cgc ttc ate cag tac aac ate ttc ggc aac gtg ttc gag gtc ace tec 192 

Arg Phe lie Gin Tyr Asn lie Phe Gly Asn Val Phe Glu Val Thr Ser 

50 55 60 

aag tac aag ccc ccc ate etc ccc ate ggc aag ggc gcc tac ggc ate 240 

Lys Tyr Lys Pro Pro He Leu Pro He Gly Lys Gly Ala Tyr Gly He 

65 70 75 80 

gtc tgc teg gcg etc aac tec gag acg gca gag cag gtg gcc ate aag 288 

Val Cys Ser Ala Leu Asn Ser Glu Thr Ala Glu Gin Val Ala He Lys 

85 90 95 

aag ate gcc aac gcc ttc gac aac aag ate gat gcc aag cgc acg etc 336 

Lys He Ala Asn Ala Phe Asp Asn Lys He Asp Ala Lys Arg Thr Leu 

100 105 110 

cgc gag ate aag ctg etc cgc cac atg gac cac gag aat att gtt gca 384 

Arg Glu He Lys Leu Leu Arg His Met Asp His Glu Asn He Val Ala 

115 120 125 

ata aga gat ate ata cct cct cca ttg agg gag gca ttc aat gat gtg 432 

He Arg Asp He He Pro Pro Pro Leu Arg Glu Ala Phe Asn Asp Val 

130 135 140 

tat att gcc tat gaa ttg atg gat act gat ctg cat caa att att cgt 480 

Tyr He Ala Tyr Glu Leu Met Asp Thr Asp Leu His Gin He He Arg 

145 150 155 160 

tea aat caa get ttg tea gag gag cac tgt cag tat ttt ctt tat caa 528 

Ser Asn Gin Ala Leu Ser Glu Glu His Cys Gin Tyr Phe Leu Tyr Gin 

165 170 • 175 



att ctt cgt ggc ttg aag tat ata cat tea gca aat gtc ctt cac cgt 
He Leu Arg Gly Leu Lys Tyr He His Ser Ala Asn Val Leu His Arg 



576 
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180 185 190 

gac ttg aag cct age aat ctt ctt ttg aat gca aac tgt gac etc aag 624 
Asp Leu Lys Pro Ser Asn Leu Leu Leu Asn Ala Asn Cys Asp Leu Lys 
195 200 205 

ata tgt gat ttt ggg ctt get cgc ace acc tea gaa act gat ttt atg 672 
lie Cys Asp Phe Gly Leu Ala Arg Thr Thr Ser Glu Thr Asp Phe Met 
210 215 220 

act gaa tat gtt gtc aca aga tgg tat aga gca cca gag ctt tta ttg 720 
Thr Glu Tyr Val Val Thr Arg Trp Tyr Arg Ala Pro Glu Leu Leu Leu 
225 230 235 240 

aac tec tct gaa tat act get gee att gat grg tgg tet gtg ggc tgt 768 
Asn Ser Ser Glu Tyr Thr Ala Ala lie Asp Val Trp Ser Val Gly Cys 
245 250 255 

ata ttt atg gaa ctg atg gac cga aaa cec ttg ttt cct gga aga gat 816 
lie Phe Met Glu Leu Met Asp Arg Lys Pro Leu Phe Pro Gly Arg Asp 
260 265 270 

cat gtc cat cag eta cgt eta eta atg gag czc att gga aca ecg aat 864 
His Val His Gin Leu Arg Leu Leu Met Glu Leu lie Gly Thr Pro Asn 
275 280 285 

gag get gat ctt gat ttt gta aat gaa aat gca aga aga tat ate cgc - 912 
Glu Ala Asp Leu Asp Phe Val Asn Glu Asn Ala Arg Arg Tyr lie Arg 
290 295 300 

caa ctt cec tgt eat get aga cag tec ttc cet gaa aaa ttt cca eat 960 
Gin Leu Pro Cys His Ala Arg Gin Ser Phe Pro Glu Lys Phe Pro His 
305 310 315 320 

gta caa cct tta gca att gac eta gtg gaa aag atg eta act ttt gat 1008 
Val Gin Pro Leu Ala lie Asp Leu Val Glu Lys Met Leu Thr Phe Asp 
325 330 335 

cct aga cag aga aca act gtt gaa ggc gca ctt gca cac cct tac ttg ■ 1056 
Pro Arg Gin Arg lie Thr Val Glu Gly Ala Leu Ala His Pro Tyr Leu 
340 345 350 

gca tea ctt cat- gac ata agt gat gag cca gzc tgc tea atg cec ttc 1104 
Ala Ser Leu His Asp lie Ser Asp Glu Pro Val Cys Ser Met Pro Phe 
355 360 365 

age ttc aac ttc. gag cag eat gca tta tet gaa gaa cag atg aag gat 1152 
Ser Phe Acp Phe Glu Gin His Ala Leu Ser Glu Glu Gin Met Lys Asp • 
370 375 380 

Ctg acc tac caa gag get ctt gca ttc aac cca gat tac cag 1194 

Leu lie T>'r Gin Glu Ala Leu Ala Phe Asn Pro Asp Tyr Gin 
385 390 395 

tag 1197 

<210> 34 

<211> 398 

<212> PRT 

<213> 2ea mays 

<400> 34 
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Met 


Asp 


Gly 


Gly 


Gly 


Gin 


Pro 


Pro 


Asp 


Thr 


Glu 


Met 


Thr 


Asp 


Ala 


Gly 


1 








5 










10 










15 




Leu 


Gly 


Gly 


Gly 


Gly 


Gin 


Pro 


Pro 


Pro 


Pro 


Pro 


.Gin 


Gin 


Pro 


Ala 


Gly 








20 










25 










30 






Gly 


Ala 


Gly 


Met 


Met 


Glu 


Asn 


He 


His 


Ala 


Thr 


Leu 


Ser 


His 


Gly 


Gly 






35 










40 










45 








Arg 


Phe 


He 


Gin 


Tyr 


Asn 


He 


Phe 


Gly 


Asn 


Val 


Phe 


Glu 


Val 


Thr 


Ser 




50 










55 










60 










Lvs 


Tvr 


Lys 


Pro 


Pro 


He 


Leu 


Pro 


He 


Gly 


Lys 


Gly 


Ala 


Tyr 


Gly 


He 


65 








70 










75 










80 


Val 


Cvs 


Ser 


Ala 


Leu 


Asn 


Ser 


Glu 


Thr 


Ala 


Glu 


Gin 


Val 


Ala 


He 


Lys 








85 










90 










95 




Ly s 


lie 


Ala 


Asn 


Ala 


Phe 


Asp 


Asn 


Lys 


He 


Asp 


Ala 


Lys 


Arg 


Thr 


Leu 






100 










105 










110 






Aara 


Glu 


He 


Lvs 


Leu 


Leu 


Arg 


His 


Met 


Asp 


His 


Glu 


Asn 


He 


Val 


Ala 




115 










120 










125 








lie 


ArQ 


ASD 


He 


He 


Pro 


Pro 


Pro 


Leu 


Arg 


Glu 


Ala 


Phe 


Asn 


Asp 


Val 




130 










135 










140 










Tvr 


lie 


Ala 


Tvr 


Glu 


Leu 


Met 


Asp 


Thr 


Asp 


Leu 


His 


Gin 


He 


He 


Arg 


145 










150 










155 










160 


Ser 


Asn 


Gin 


Ala 


Leu 


Ser 


Glu 


Glu 


His 


Cys 


Gin 


Tyr 


Phe 


Leu 


Tyr 


Gin 










165 










170 










175 




lie 


Leu 


Arci 


Gly 


Leu 


Lys 


Tyr 


He 


His 


Ser 


Ala 


Asn 


Val 


Leu 


His 


Arg 








180 










185 










190 






Asp 


Leu 


Lvs 


Pro 


Ser 


Asn 


Leu 


Leu 


Leu 


Asn 


Ala 


Asn 


Cys 


Asp 


Leu 


Lys 






195 










200 










205 








lie 


Cvs 


Asp 


Phe 


Gly 


Leu 


Ala 


Arg 


Thr 


Thr 


Ser 


Glu 


Thr 


Asp 


Phe 


Met 




210 










215 










220 










Thr 


Glu 


Tvr 


Val 


Val 


Thr 


Arcr 


Trp 


Tyr 


Arg 


Ala 


Pro 


Glu 


Leu 


Leu 


Leu 


225 










230 










235 










240 


Asn 


Ser 


Ser 


Glu 


Tvr 

i _y 4. 


Thr 


Ala 


Ala 


He 


Asp 


Val 


Trp 


Ser 


Val 


Gly 


Cys 










245 










250 










255 




lie 


Phe 


Met 


Glu 


Leu 


Met 


Asp 


Ara 


Lys 


Pro 


Leu 


Phe 


Pro 


Gly 


Arg 


Asp 








260 










265 










270 






His 


Val 


His 


Gin 


Leu 


ArQ 


Leu 


Leu 


Met 


Glu 


Leu 


He 


Gly Thr 


Pro 


Asn 






275 










280 










285 








Glu 


Ala 


Asp 


Leu 


ASD 


Phe 


Val 


Asn 


Glu 


Asn 


Ala 


Arg 


Arg 


Tyr 


He 


Arg 




290 










295 










300 










Gin 


Leu 


Pro 


Cys 


His 


Ala 


Arg 


Gin 


Ser 


Phe 


Pro 


Glu 


Lys 


Phe 


Pro 


His 


305 










310 










315 










320 


Val 


Gin 


Pro 


Leu 


Ala 


He 


Asp 


Leu 


Val 


Glu 


Lys 


Met 


Leu 


Thr 


Phe 


Asp 










325 










330 










335 




Pro 


Arg 


Gin 


Arg 


He 


Thr 


Val 


Glu 


Gly 


Ala 


Leu 


Ala 


His 


Pro 


Tyr 


Leu 








340 










345 










350 






Ala 


Ser 


Leu 


His 


Asp 


He 


Ser 


Asp 


Glu 


Pro 


Val 


Cys 


Ser 


Met 


Pro 


Phe 






355 










360 










365 








Ser 


Phe 


Asp 


Phe 


Glu 


Gin 


His 


Ala 


Leu 


Ser 


Glu 


Glu 


Gin 


Met 


Lys 


Asp 




370 










375 










380 










Leu 


He 


Tyr 


Gin 


Glu 


Ala 


Leu 


Ala 


Phe 


Asn 


Pro 


Asp 


Tyr 


Gin 







385 390 395 



<210> 35 . 
<211> 20 
<212> DNA 
<213> 2ea mays 

<400> 35 
atggacggcg gggggcagcc 



<210> 36 
<211> 20 
<212> DNA 
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<213> 2ea mays 
<400> 36 

ctactggtaa tctgggttga 20 

<210> 37 
<211> 1053 
c212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1) . . . (1050) 

<:400> 37 

atg gcg acg cca egg aag ccg ate aag etc acg ctg ccg tec cac gag 48 

Met Ala Thr Pro Arg Lys Pro lie Lys Leu Thr Leu Pro Ser His Glu 

15 10 15 

acc ace ate ggc aag ttc ctg acc cac age ggg acg ttc acg gac ggg 96 
Thr Thr lie Gly Lys Phe Leu Thr Kis Ser Gly Thr Phe Thr Asp Gly 
20 25 30 

gat ctg cgc gtc aac aag gac ggc etc cgc ate gtc teg egg agg gag 144 
Asp Leu Arg Val Asn Lys Asp Gly Leu Arg lie Val Ser Arg Arg Glu 
,35 40 45 

gga ggc gag get cct cct ata gag ccg ttg gat agt caa ctg age tta 192 
Gly Gly Glu Ala Pro Pro lie Glu Pro Leu Asp Ser Gin Leu Ser Leu 
50 55 60 

gat gat eta gac gtt ata aaa gtg ate ggg aaa ggt age age gga aat 240 
Asp Asp Leu Asp Val He Lys Val He Gly Lys Gly Ser Ser Gly Asn 
65 70 75 80 

gtg caa ttg gtc cgc cac aaa ttt act ggc cag ttt ttt get ctg aag 288 
Val Gin Leu Val Arg His Lys Phe Thr Gly Gin Phe Phe Ala Leu Lys 
85 90 95 

gtt att caa eta aat att gat gag agt ata cgc aaa cag att gee aag 336 
Val He Gin Leu Asn He Asp Glu Ser He Arg Lys Gin lie Ala Lys 
100 105 110 

gag ttg aag ata aac tta tea aca cag tgc caa tat gtt gtt gtg ttc 384 
Glu Leu Lys He Asn Leu Ser Thr Gin Cys Gin Tyr Val Val Val Phe 
115 120 125 

tat cag tgt ttc tat ttc aat ggt gee att tct att gtt ttg gaa tac 432 
Tyr Gin Cys Phe Tyr Phe Asn Gly Ala He Ser He Val Leu Glii Tyr 
130 135 140 

atg gat. ggt ggc tec ctt gca gat ttc ctg aag act gtt aaa acc att 480 
Met Asp Gly Gly Ser Leu Ala Asp Phe Leu Lys Thr Val Lys Thr He 
145 „ ' 150 155 160 

cca gag. gee tac etc get get ate tgt acg cag atg eta aaa gga ctg 528 
Pro Glu Ala Tyr Leu Ala Ala He Cys Thr Gin Met Leu Lys Gly Leu 
165 170 175 

ate tat ttg cat aac gag aag cgc gtt ata cac cga gat ctg aaa cca 576 
He Tyr Leu His Asn Glu Lys Arg Val He His Arg Asp Leu Lys Pro 
- 180 185 190 
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tea aat ata ttg ata aat cat agg ggt gaa g-a aaa aca tea gat ttt 624 
Ser Asn lie Leu lie Asn His Arg Gly Glu Val Lys lie Ser Asp Phe 
195 200 205 

ggt gtg agt gcc att ata tct agt tec tct trg caa cga gat aca ttt 672 
Gly Val Ser Ala lie He Ser Ser Ser Ser Ser Gin Arg Asp Thr Phe 
210 215 220 

att ggc aca cgc aac tac atg gcg cca gaa aga ate gat gga aag aaa 720 
He Gly Thr Arg Asn Tyr Met Ala Pro Glu Arg He Asp Gly Lys Lys 
225 230 235 240 

cat ggt tct atg agt gat ate tgg agt ttg gga eta gtg ata ctg gaa 768 
His Gly Ser Met Ser Asp He Trp Ser Leu Gly Leu Val He Leu Glu 
245 250 255 

tgt gca acc ggc ate ttt cca ttt cct cct tgt gaa age ttc tac gaa 816 
Cys Ala Thr Gly He Phe Pro Phe Pro Pro Cys Glu Ser Phe Tyr Glu 
260 265 270 

ctt etc gtg get gtt gtt gat caa ccg cca cct tct gcg ccg ccg gat 864 
Leu Leu Val Ala Val Val Asp Gin Pro Pro Pro Ser Ala Pro Pro Asp 
275 280 285 

cag ttt tea cca gaa ttc tgt ggg ttc att tct gca tgt etc cag aag 912 
Gin Phe Ser Pro Glu Phe Cys Gly Phe He Ser Ala Cys Leu Gin Lys 
290 295 300 

gat get aat gac agg tea tea gcc caa gee tta ttg gae cat ccg ttc 960 
Asp Ala Asn Asp Arg Ser Ser Ala Gin Ala Leu Leu Asp His Pro Phe 
305 310 315 320 

ctg age atg tat gat gac ctg eat gta gat ctt get teg tac ttc acg 1008 
Leu Ser Met Tyr Asp Asp Leu His Val Asp Leu Ala Ser Tyr Phe Thr 
325 330 335 

aca gea gga tct cct etc gcc acc ttc aat tec agg caa etc 1050 
Thr Ala Gly Ser Pro Leu Ala Thr Phe Asn Ser Arg Gin Leu 
340 345 350 

taa 1053 

<210> 38 . . ~ . 

~<211> 350 

<212> PRT 

<213> Zea mays 

<400> 38 

Met Ala Thr Pro Arg Lys Pro lie Lys Leu Thr Leu Pro Ser His Glu 

15 10 15 

Thr Thr He Gly Lys Phe Leu Thr His Ser Gly Thr Phe Thr Asp Gly 

20 25 30 

Asp Leu Arg Val Asn Lys Asp Gly Leu Arg He Val Ser Arg Arg Glu 

35 40 45 

Gly Gly Glu Ala Pro Pro He Glu Pro Leu Asp Ser Gin Leu Ser Leu 

50 55 60 

Asp Asp Leu Asp Val He Lys Val He Gly Lys Gly Ser Ser Gly Asn 
65 70 75 80 

Val Gin Leu Val Arg His Lys Phe Thr Gly C-ln Phe Phe Ala .Leu Lys 

85 90 95 

Val He Gin Leu Asn He Asp Glu Ser lie Arg Lys Gin He Ala Lys 
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100 










105 










110 






Glu 


Leu 


Lys 


He 


Asn 


Leu 


Ser 


Thr 


Glr. 


Cys 


Gin 


Tyr 


Val 


Val 


Val 


Phe 






115 










120 










125 








Tyr 


Gin 


Cys 


Phe 


Tyr 


Phe 


Asn 


Gly 


Ala 


He 


Ser 


He 


Val 


Leu 


Glu 


Tyr 




130 










135 










140 










Met 


Asp 


Gly 


Gly 


Ser 


Leu 


Ala 


Asp 


Phe 


Leu 


Lys 


Thr 


Val 


Lys 


Thr 


He 


145 










150 










155 










160 


Pro 


Glu 


Ala 


Tyr 


Leu 


Ala 


Ala 


He 


Cys 


Thr 


Gin 


Met 


Leu 


Lys 


Gly Leu 










165 










170 










175 




He 


Tyr 


Leu 


His 


Asn 


Glu 


Lys 


Arcr 


Vai 


He 


His 


Ara 


Asp 


Leu 


Lys 


Pro 








180 










185 










190 






Ser 


Asn 


He 


Leu 


He 


Asn 


His 


Ara 


Glv 


Glu 


Val 


Lys 


He 


Ser 


Asp 


Phe 






195 










200 










205 








Gly Val 


Se r 


Ala 


He 


He 


Ser 


Ser 


Ser 


Ser 


Ser 


Gin 


Aro 


Asp 


Thr 


Phe 




210 










215 










220 










He 


Gly 


Thr 


Ara 


Asn 


Tvr 


Met 


Ala 


Pro 


Glu 


ArQ 


He 


Asp 


Glv 


Lys 


Lys 


225 










230 










235 










240 


His 


Gly 


Ser 


Met 


Ser 


ASD 


He 


Trp 


Ser 


Leu 


Gly 


Leu 


Val 


He 


Leu 


Glu 










245 










250 










255 




Cys 


Ala 


Thr 


Gly 


He 


Phe 


Pro 


Phe 


Pre 


Pro 


Cys 


Glu 


Ser 


Phe 


Tyr 


Glu 








260 










265 










270 






Leu 


Leu 


Val 


Ala 


Val 


Val 


Asp 


Gin 


Pro 


Pro 


Pro 


Ser 


Ala 


Pro 


Pro 


Asp 






275 










280 










285 








Gin 


Phe 


Ser 


Pro 


Glu 


Phe 


Cys 


Gly 


Phe 


He 


Ser 


Ala 


Cys 


Leu 


Gin 


Lys 




290 










295 










300 










Asp 


Ala 


Asn 


Asp 


Arg 


Ser 


Ser 


Ala 


Gin 


Ala 


Leu 


Leu 


Asp 


His 


Pro 


Phe 


305 










310 










315 










320 


Leu 


Ser 


Met 


Tyr 


Asp 


Asp 


Leu 


His 


Val 


Asp 


Leu 


Ala 


Ser 


Tyr 


Phe 


Thr 










325 










330 










335 




Thr 


Ala 


Gly 


Ser 


Pro 


Leu 


Ala 


Thr 


Phe 


Asn 


Ser 


Arg 


Gin 


Leu 







<210> 39 
<211> 20 
<212> UNA 
<213> Zea mays 

<400> 39 

atggcgacgc cacggaagcc 2 0 

<210> 40 
<211> 20 
<212> DNA 
<213> Zea mays 

<400> 40 

ttagagttgc ctggaattga 20 

<210> 41 . ' 

<21-1> 846 
<212> DNA 
<213> Zea mays 

. <22G> 
<221> CDS 
<:222> (1) . . . {843) 



<400> 41 

atg get caa gag etc cac gaa acg tec tct tgc tct gee aec ace acc 48 

Met Ala Gin Glu Leu His Glu Thr Ser Ser Cys Ser Ala Thr Thr Thr 

1-5 10 15 
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teg teg tgc acc aca tec tgc tgc teg tec act gtc aea gae teg tec 96 
Ser Ser Cys Thr Thr Ser Cys Cys Ser Ser Thr Val Thr Asp Ser Ser 
20 - 25 . 30 

tct teg cec ccg tea eeg geg gcg gee aat gee gcg eec gcg aea egg 144 
Ser Ser Pro Pro Ser Pro Ala Ala Ala Asn Ala Ala Pro Ala Thr Arg 
35 40 45 

aag egg cag gcg ttg gag gee gag gee gag gee gag geg ggc ggt gag 192 
Lys Arg Gin Ala Leu Glu Ala Glu Ala Glu Ala Glu Ala Gly Gly Glu 
50 55 60 

gag gag gag gag gag gag gga age tgt get ggt aat aaa gcg gcg ccg 24 0 

Glu Glu Glu Glu Glu Glu Gly Ser Cys Ala Gly Asn Lys Ala Ala Pro 
65 70 75 80 

gee aag aag cga ccg egg ggc age gag ggg aaa cae eeg aeg ttc egc 288 
Ala Lys Lys Arg Pro Arg Gly Ser Glu Gly Lys His Pro Thr Phe Arg 
85 90 95 

ggc gtg egg atg egg geg tgg ggc aag rgg gtg teg gag ate egc gag 336 
Gly Val Arg Met Arg Ala Trp Gly Lys Trp Val Ser Glu He Arg Glu 
100 105 110 

ccg egc aag aag teg egc ata tgg etc ggc aeg ttc cec aec gee gag 384 
Pro Arg Lys Lys Ser Arg He Trp Leu Gly Thr Phe Pro Thr Ala Glu 

11c. i-^n 



115 120 125 

atg gee gcg egc gee cae gae gtc geg gcg etc gee ate aaa ggc egc 
Met Ala Ala Arg Ala His Asp Val Ala Ala Leu Ala He Lys Gly Arg 
130 135 140 



432 



gcc gcg cae etc aac ttc ccg gae ttc gee ggc gcg etc ccg egc gee 480 
Ala Ala His Leu Asn Phe Pro Asp Phe Ala Gly Ala Leu Pro Arg Ala 
145 150 155 160 

gcg tec gcg gcg cec aag gae gtc cag gca gcc gcc gca ttg gcc get 528 
Ala Ser Ala Ala Pro Lys Asp Val Gin Ala Ala Ala Ala Leu Ala Ala 
165 170 175 

gcg ttc aeg teg ccg tea teg gag cec ggc gcc ggc gcg cac gag gag 576 
Ala Phe Thr Ser Pro Ser Ser Glu Pro Gly Ala Gly Ala His Glu Glu 
180 185 190 

cec get gcc aag gae ggc gcc gcg cec gag gag gca gee gcc gae gca 624 
Pro Ala Ala Lys Asp Gly Ala Ala Pro Glu Glu Ala Ala Ala Asp Ala 
195 200 205 

cag gca cca gta cea gta gca eta eca ccg ccg gcg gee tct egg eca 672 
Gin Ala Pro Val Pro Val Ala Leu Pro Pro Pro Ala Ala Ser Arg Pro 
210 215 220 

ggg aeg ccg teg age ggc gtg gag gae gag egg cag ctg ttc gae ctg 720 
Gly Thr Pro Ser Ser Gly Val Glu Asp Glu Arg Gin Leu Phe Asp Leu 
225 230 235 240 

ccg gae ctg etc etc gae ate egg gae ggg rtc ggg egc ttc ccg ccg 768 
Pro Asp Leu Leu Leu Asp He Arg Asp Gly Phe Gly Arg Phe Pro Pro 
245 250 255 

^tg tgg gee ccg etc act gae gtg gag gag gtg gtc aat gcg gag ctg 816 
Met Trp Ala Pro Leu Thr Asp Val Glu Glu Val Val Asn Ala Glu Leu 
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260 265 270 

cgc etc gag gag ccg ctg ctt tgg gag cag 846 

Arg Leu Glu Glu Pro Leu Leu Trp Glu 

275 280 



<210> 42 

<211> 281 

<212> PRT 

<213> Zea mays 

<400> 42 

Met Ala Gin Glu Leu His Glu Thr Ser Ser Cys Ser Ala Thr Thr Thr 

15 10 15 

Ser Ser Cys Thr Thr Ser Cys Cys Ser Ser Thr Val Thr Asp Ser Ser 

20 25 30 

Ser Ser Pro Pro Ser Pro Ala Ala Ala Asn Ala Ala Pro Ala Thr Arg 

35 40 45 

Lys Arg Gin Ala Leu Glu Ala Glu Ala Glu Ala Glu Ala Gly Gly Glu 

50 55 60 

Glu Glu Glu Glu Glu Glu Gly Ser Cys Ala Gly Asn Lys Ala Ala Pro 
65 70 75 80 

Ala Lys Lys Arg Pro Arg Gly Ser Glu Gly Lys His Pro Thr Phe Arg 

85 90 95 

Gly Val Arg Met Arg Ala Trp Gly Lys Trp Val Ser Glu lie Arg Glu 

100 105 110 

Pro Arg Lys Lys Ser Arg He Trp Leu Gly Thr Phe Pro Thr Ala Glu 

115 . 120 125 

Met Ala Ala Arg Ala His Asp Val Ala Ala Leu Ala He Lys Gly Arg 

130 135 140 

Ala Ala His Leu Asn Phe Pro Asp Phe Ala Gly Ala Leu Pro Arg Ala 
145 150 155 160 

Ala Ser Ala Ala Pro Lys Asp Val Gin Ala Ala Ala Ala Leu Ala Ala 

165 170 175 

Ala Phe Thr Ser Pro Ser Ser Glu Pro Gly Ala Gly Ala His Glu Glu 

180 185 190 

Pro Ala Ala Lys Asp Gly Ala Ala Pro Glu Glu Ala Ala Ala Asp Ala 

195 200 205 

Gin Ala Pro Val Pro Val Ala Leu Pro Pro Pro Ala Ala Ser Arg Pro 

210 215 220 

Gly Thr Pro Ser Ser Gly Val Glu Asp Glu Arg Gin Leu Phe Asp Leu 
225 230 235 240 

Pro Asp Leu Leu Leu Asp He Arg Asp Gly Phe Gly Arg Phe Pro Pro 

245 250 255 

Met Trp Ala Pro Leu Thr Asp Val Glu Glu Val Val Asn Ala Glu Leu 

260 265 270 

Arg Leu Glu Glu Pro Leu Leu Trp Glu 
. 275 . 280 

<210> 43 

<211> 20 

<212> DNA 

<213> Zea mays 

<400> 43 

atggctcaag agct.ccacga 20 

<210> 44 
<211> 20 
<212> DNA 
<213> Zea mays 
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<400> 44 

ctactcccaa agcagcggct 20 

<210> 45 

<211> 2257 

<212> DNA 

<213> Zea mays 

<220> 
<221> CDS 

<222> (108) . . . (1844) 
<400> 45 

gtcgacccac gcgtccgtgc ttggagcggg tggacgtgtg gtgtgtatat agttaagcgg 60 
gcgggcgggc gggcagccaa tgccgtcggg cgcgcagggc taggagc atg ggc gga 116 

Met Gly Gly 
1 

999 99c gcg ate gtg cac ggc ttc cgc cgc egg ttc cac cgc egg age 164 
Gly Gly Ala lie Val His Gly Phe Arg Arg Trp Phe His Arg Arg Ser 
5 10 15 

gaa ggc tec acc tec aac tec aac cag teg tec etc gee ggc gag ggg 212 
Glu Gly Ser Thr Ser Asn Ser Asn Gin Ser Ser Leu Ala Gly Glu Gly 
20 25 30 35 

gaa gcg gac gac ggc tea tec gat etc gag gtc ate gag gac ccg gat 260 
Glu Ala Asp Asp Gly Ser Ser Asp Leu Glu Val lie Glu Asp Pro Asp 
40 45 50 

etc etc ggc etc cgc gee ate cgc gtg cec aag cgc aag atg ccg ctg 308 
Leu Leu Gly Leu Arg Ala lie Arg Val Pro Lys Arg Lys Met Pro Leu 
55 60 65 

cec gtc gag age cac agg aag aac tea gta gaa atg gag ttc ttc aca 356 
Pro Val Glu Ser His Arg Lys Asn Ser Val Glu Met Glu Phe Phe Thr 
70 75 80 

gag tac gga gag gca age cag tac caa ate caa gaa gtc att ggt aaa 404 
Glu Tyr Gly Glu Ala Ser Gin Tyr Gin lie Gin Glu Val He Gly Lys 
85 90 95 

ggg age tac gga gta gtt get gca get gtg gac acc cgc ace ggt gag 452 
Gly Ser Tyr Gly Val Val Ala Ala Ala Val A-sp Thr Arg Thr Gly Glu 
100 105 110 115 

egg gtt get ate aag aag ate aat gac gtg ttt gag cac gtc teg gat- 500 
Arg Val Ala He Lys Lys He Asn Asp Val Phe Glu His Val Ser Asp 
120 * 125 130 

gcc aca cgc ata ctg egg gag ate aag etc etc egg ttg ett cgt cac 548 
Ala Thr Arg He Leu Arg Glu He Lys Leu Leu Arg Leu Leu Arg His 
135 140 145 

cca gac ata gtc gag att aag cac ate atg etc cct cet tet egg agg 596 
Pro Asp He Val Glu He Lys His He Met Leu Pro Pro Ser Arg Arg 
150 155 160 

gag ttc caa gac ate tat gtt gtt ttt gag etc atg gag teg gat etc 644 
Glu Phe Gin Asp He Tyr Val Val Phe Glu Leu Met Glu Ser Asp Leu 
165 170 175 
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cat caa gtg acc agg gcg aat gat gac etc acc cct gaa cat cac cag 692 
His Gin Val lie Arg Ala Asn Asp Asp Leu Thr Pro Glu His His Gin 
180 185 190 195 

ttt ttc ctg tac caa ctt ctt cga get etc aag tac ata cat gcc get 740 
Phe Phe Leu Tyr Gin Leu Leu Arg Ala Leu Lys Tyr lie His Ala Ala 
200 205' 210 

aat gta ttc cat cgt gat eta aag ccc aag aat ata ctg get aac teg 788 
Asn Val Phe His Arg Asp Leu Lys Pro Lys Asn lie Leu Ala Asn Ser 
215 220 225 

gac tgc aaa ctg aaa ata tgt gac ttt gga ctt gcc cgt gea tea ttc 836 
Asp Cys Lys Leu Lys lie Cys Asp Phe Gly Leu Ala Arg Ala Ser Phe 



230 235 240 

aat gat gee cct tea get ata ttt tgg aca gac tat gtg gea aca agg 

Asn Asp Ala Pro Ser Ala lie Phe Trp Thr Asp Tyr Val Ala Thr Arg 

0/1C c n, -MTrr 



245 250 255 



884 



tgg tac egg gea cct gaa eta tgt ggt tee ttt ttc tee aaa tac act 932 
Trp Tyr Arg Ala Pro Glu Leu Cys Gly Ser Phe Phe Ser Lys Tyr Thr 

260 265 270 275 

cct gea att gat att tgg age att ggg tgc ata ttt get gaa ctt etc 980 

Pro Ala lie Asp He Trp Ser He Gly Cys He Phe Ala Glu Leu Leu 

280 285 290 

act gga agg cca eta ttt cct ggg aag aat gtt gta cac caa tta gat 1028 

Thr Gly Arg Pro Leu Phe Pro Gly Lys Asn Val Val His Gin Leu Asp 

295 300 305 

att ata aca gat ctt ctt ggg act cca tea tet gag acc tta tec egg 1076 

He He Thr Asp Leu Leu Gly Thr Pro Ser Ser Glu Thr Leu Ser Arg 

310 315 320 

att cga aat gag aag gee agg aga tat ctg age tgc atg egg aaa aaa 1124 

He Arg Asn Glu Lys Ala Arg Arg Tyr Leu Ser Cys Met Arg Lys Lys 

325 330 335 

tat ccc gtg ccc ttt act cat aaa ttt cga cat get gat cct ttg get 1172 

Tyr Pro Val- Pro Phe Thr His Lys Phe Arg His Ala Asp Pro Leu Ala / 

340 345 350 355 

ctt cgt ctt eta gag cgt ttg ctt gea ttt gat cct aaa gac egg cct* 1220 

Leu Arg Leu Leu Glu Arg Leu Leu Ala Phe Asp Pro Lys Asp Arg Pro 

360 365 370 

agt gea gaa gag get eta get gat cca tac ttt gea tec ctt get aat 1268 

Ser Ala Glu Glu Ala Leu Ala Asp Pro Tyr Phe Ala Ser Leu Ala Asn 

375 380 385 

gtg gaa cgt gag cca tea aga cac eeg att tet aaa ctt gag ttt gaa 1316 

Val Glu Arg Glu Pro Ser Arg His Pro He Ser Lys Leu Glu Phe Glu 

390 395 400 

ttt gaa aga aag aag ctg gee aaa gat gat gtt aga gaa ctg ate tat 1364 

Phe Glu Arg Lys Lys Leu Ala Lys Asp Asp Val Arg Glu Leu He Tyr 

405 410 415 

cga gag att ttg gag tat cat cca cag atg ctg gag gag tac atg aaa 1412 
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Arg Glu lie Leu Glu Tyr His Pro Gin Met. Leu Glu Glu Tyr Met Lys 
420 425 430 435 



99c gga gaa cag att age ttc etc taz cca age gga gtc gac cgc ttc 
Gly Gly Glu Gin lie Ser Phe Leu Tyr Pro Ser Gly Val Asp Arg Phe 
440 445 450 



1460 



aaa ctg cag ttc get eat ctt gag gag cat tac age aaa gga gaa aga 
Lys Leu Gin Phe Ala His Leu Glu Glu His Tyr Ser Lys Gly Glu Arg 
455 460 465 



1508 



ggt tec eca ctg caa aga aag cat get zee tta cca agg gaa aga gtg 
Gly Ser Pro Leu Gin Arg Lys His Ala Ser Leu Pro Arg Glu Arg Val 
470 475 480 



1556 



gtt gta tec aag gat ggc aat act gaa caa caa att aat gac cag gag 
Val Val Ser Lys Asp Gly Asn Thr Glu Gin Gin lie Asn Asp Gin Glu 
485 490 495 



1604 



aga age gca gat tct gtt gee cgc act ace gtg age cct cca agg tea 
Arg Ser Ala Asp Ser Val Ala Arg Thr Thr Val Ser Pro Pro Arg Ser 
500 505 510 515 



1652 



gag gat get gac atg gat gat gtg aaa ccc aea age tta age tct egg 
Glu Asp Ala Asp Met Asp Asp Val Lys Pro Thr Ser Leu Ser Ser Arg 
520 525 530 



1700 



age tac ctg aag agt gca age ata age get tec aag tgc gtc gtt gtc 
Ser Tyr Leu Lys Ser Ala Ser lie Ser Ala Ser Lys Cys Val val Val 
535 540 545 



1748 



aga aat aaa cae cca gag cca gag gat gat gga ata cet gag gag atg 
Arg Asn Lys His Pro Glu Pro Glu Asp Asp Gly lie Pro Glu Glu Met 
550 555 560 



1796 



gaa ggc aec gtc gat gga ttg tct gaa aag gtc tec agg atg cae tec 
Glu Gly Thr Val Asp Gly Leu Ser Glu Lys Val Ser Arg Met His Ser 
565 570 575 



1844 



taaggcagga 
taatgccgac 
tagcactagt 
ccatgaggtg 
ccgtacgaag 
tgtctagaaa 
aaaaaaaaaa 



ctgtccatgg 
gacacactgt 
tcgtccatgg 
ttttcttcca 
tgtcageagt 
attatgagta 
aaaaaaaaaa 



egacagcaat 
ttggatcttg 
aggaagcaga 
atggagteta 
gtacgaggag 
etttctccaa 
aaaaaaaaaa 



etegaeacaa 
tgtacaccca 
atggacgatg 
gggratgttt 
gaggatatcg 
aaaaaaaaaa 
aaaaaaaaaa 



gatttctgtc 
cagctagttc 
aagtgtggtc 
gagttetaca 
ttgtcagtac 
aaaaaaaaaa 
aaagggcggc 



tggtgcactg 
aactaagatg 
agcctgtcga 
acggettgtg 
aeattttgcc 
aaaaaaaaaa 
cgc - 



1904 
1964 
2024 
2084 
2144 
2204 
2257 



<210> 46 
<211> 579 
<212> PRT 
<213> 2ea mays 





<400> 


46 








Met 

1 


Gly 


Gly 


Gly 


Gly Ala 
5 


lie 


Val 


Arg 


Arg 


Ser 


Glu 


Gly Ser 


Thr 


Ser 








20 








Gly 


Glu 


Gly 


Glu 


Ala Asp 


Asp 


Gly 






35 








40 


Asp 


Pro 


Asp 


Leu 


Leu Gly 


Leu 


Arg 




50 








55 




Met 


Pro 


Leu 


Pro 


Val Glu 


Ser 


His 



His Gly Phe Arg Arg Trp Phe His 

10 15 , 

Asn Ser Asn Gin Ser Ser Leu Ala 
25 30 
Ser Ser Asp Leu Glu Val He. Glu 
45 . 

Ala He Arg Val Pro Lys Arg Lys 
60 

Arg Lys Asn Ser Val Glu Met Glu 
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65 70 75 80 



Phe 


Phe 


Thr 


Glu 


Tyr 


Gly 


Glu 


Ala 


Ser 


Gin 


Tyr 


Gin 


He 


Gin 


Glu 


Val 










85 










90 










95 




He 


Gly 


Lys 


Gly 


Ser 


Tyr 


Gly 


Val 


Val 


Ala 


Ala 


Ala 


Val 


Asp 


Thr 


Arg 








100 










105 










110 






Thr Gly Glu 


Arg 


Val 


Ala 


He 


Lys 


Lys 


He 


Asn 


Asp 


Val 


Phe 


Glu 


His 






115 










120 










125 








Val 


Ser 


Asp 


Ala 


Thr 


Arg 


He 


Leu 


Arg 


Glu 


He 


Lys 


Leu 


Leu 


Arg 


Leu 




130 










135 










140 










Leu 


Arg 


His 


Pro 


Asp 


He 


Val 


Glu 


He 


Lys 


His 


He 


Met 


Leu 


Pro 


Pro 


145 










150 










155 










160 


Ser 


Arg 


Arg 


Glu 


Phe 


Gin 


Asp 


He 


Tyr 


Val 


Val 


Phe 


Glu 


Leu 


Met 


Glu 










165 










170 










175 




Ser 


Asp 


Leu 


His 


Gin 


Val 


He 


Arg 


Ala 


Asn 


Asp 


Asp 


Leu 


Thr 


Pro 


Glu 








180 










185 










190 






His 


His 


Gin 


Phe 


Phe 


Leu 


Tyr 


Gin 


Leu 


Leu 


Arg 


Ala 


Leu 


Lys 


Tyr 


He 






195 










200 










205 








His 


Ala 


Ala 


Asn 


Val 


Phe 


His 


Arg 


Asp 


Leu 


Lys 


Pro 


Lys 


Asn 


He 


Leu 




210 










215 










220 










Ala 


Asn 


Ser 


Asp 


Cys 


Lys 


Leu 


Lys 


He 


Cys 


Asp 


Phe 


Gly 


Leu 


Ala 


Arg 


225 










230 










235 










240 


Ala 


Ser 


Phe 


Asn 


Asp 


Ala 


Pro 


Ser 


Ala 


He 


Phe 


Trp 


Thr 


Asp 


Tyr 


Val 










245 










250 










255 




Ala 


Thr 


Arg 


Trp 


Tyr 


Arg 


Ala 


Pro 


Glu 


Leu 


Cys 


Gly 


Ser 


Phe 


Phe 


Ser 








260 










265 










270 






Lys 


Tyr Thr 


Pro 


Ala 


He 


Asp 


He 


Trp 


Ser 


He 


Gly 


Cys 


He 


Phe 


Ala 






275 










280 










285 








Glu 


Leu 


Leu 


Thr 


Gly 


Arg 


Pro 


Leu 


Phe 


Pro 


Gly 


Lys 


Asn 


Val 


Val 


His 




290 










295 










300 










Gin 


Leu 


Asp 


He 


He 


Thr 


Asp 


Leu 


Leu 


Gly 


Thr 


Pro 


Ser 


Ser 


Glu 


Thr 


305 










310 










315 










320 


Leu 


Ser Arg 


He 


Arg 


Asn 


Glu 


Lys 


Ala 


Arg 


Arg 


Tyr 


Leu 


Ser 


Cys 


Met 










325 










330 










335 




Arg 


Lys 


Lys 


Tyr 


Pro 


Val 


Pro 


Phe 


Thr 


His 


Lys 


Phe 


Arg 


His 


Ala 


Asp 








340 










345 










350 






Pro 


Leu 


Ala 


Leu 


Arg 


Leu 


Leu 


Glu 


Arg 


Leu 


Leu 


Ala 


Phe 


Asp 


Pro 


Lys 






355 










360 










365 








Asp 


Arg 


Pro 


Ser 


Ala 


Glu 


Glu 


Ala 


Leu 


Ala 


Asp 


Pro 


Tyr 


Phe 


Ala 


Ser 




370 










375 










380 










Leu 


Ala 


Asn 


Val 


Glu 


Arg 


Glu 


Pro 


Ser 


Arg 


His 


Pro 


He 


Ser 


Lys 


Leu 


385 










390 










395 










400 


Glu 


Phe 


Glu 


Phe 


Glu 


Arg 


Lys 


Lys 


Leu 


Ala 


Lys 


Asp 


Asp 


Val 


Arg 


Glu 










405 










410 










415 




Leu 


He 


Tyr 


Arg 


Glu 


He 


Leu 


Glu 


Tyr 


His 


Pro 


Gin 


Met 


Leu 


Glu 


Glu 








420 










425 










430 






Tyr 


Met 


Lys 


Gly Gly Glu 


Gin 


He 


Ser 


Phe 


Leu 


Tyr 


Pro 


Ser 


Gly 


Val 






435 










440 










445 








Asp Arg 


Phe 


Lys 


Leu 


Gin 


Phe 


Ala 


His 


Leu 


Glu 


Glu 


His 


Tyr 


Ser 


Lys 




450 










455 










460 










Gly Glu Arg 


Gly 


Ser 


Pro 


Leu 


Gin 


Arg 


Lys 


His 


Ala 


Ser 


Leu 


Pro 


Arg 


465 










470 










475 










480 


Glu Arg Val 


Val 


Val 


Ser 


Lys 


Asp 


Gly Asn 


Thr 


Glu 


Gin 


Gin 


He 


Asn 










485 










490 










495 




Asp 


Gin 


Glu 


Arg 


Ser 


Ala 


Asp 


Ser 


Val 


Ala 


Arg 


Thr 


Thr 


Val 


Ser 


Pro 








500 










505 










510 






Pro 


Arg 


Ser 


Glu 


Asp 


Ala 


Asp 


Met 


Asp 


Asp 


Val 


Lys 


Pro 


Thr 


Ser 


Leu 






515 










520 










525 








Ser 


Ser Arg 


Ser 


Tyr 


Leu 


Lys 


Ser 


Ala 


Ser 


He 


Ser 


Ala 


Ser 


Lys 


Cys 




530 










535 










540 










Val 


Val 


Val- 


Arg 


Asn 


Lys 


His 


Pro 


Glu 


Pro 


Glu 


Asp 


Asp 


Gly 


He 


Pro 


545 










550 










555 










560 


Glu 


Glu 


Met 


Glu Gly Thr Val 


Asp 


Gly 


Leu 


Ser 


Glu 


Lys 


Val 


Ser 


Arg 



BNSDOCID: <WO 0070059A2_L> 



wo 00/70059 



PCT/USOO/11687 



565 570 575 



Met His Ser 



<210> 47 

<211> 20 

<212> DNA 

<213> 2ea mays 

<400> 47 

atgggcggag ggggcgcgat 20 

<210> 48 
<211> 20 
<212> DNA 
<213> Zea mays 

<400> 48 

ttaggagtgc atcctggaga 20 

<210> 49 
<211> 938 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (39) . . . (743) 

<400> 49 

gaattcggca cgagcagaaa gcgaagcacc agcctctg atg cca age caa age aca 56 

Met Pro Ser Gin Ser Thr 

1 5 



cac acc tec tec age ett tct tec tct aea ace cca att ett cee tgc 
His Thr Ser Ser Ser Leu Ser Ser Ser Thr Thr Pro lie Leu Pro Cys 
10 15 20 



104 



age tct get get gee ttg ttt cga teg gtc atg geg geg gtg gee aea 152 
Ser Ser Ala Ala Ala Leu Phe Arg Ser Val Met Ala Ala Val Ala Thr 
25 30 35 



gag act cca etc cat gtc ctg geg gtg gae gac age etc ceg gae agg 

GIu Thr Pro Phe His Val Leu Ala Val Asp Asp Ser Leu Pro Asp Arg 
40 45 50 

aag etc ate gag agg etc etc aag acc tct tec ttc caa gtg ace act 

Lys Lou I!tr Giu Arg Leu Leu .Lys Thr Ser Ser Phe Gin Val Thr Thr 

55 60 ' 65 70. 

gtc gac tec ggg age aag geg ctg eag ttc ctg ggc etc cat gae cag 

Val Asp Ser Gly Ser Lys Ala Leu Gin Phe Leu Gly Leu His Asp Gin 

75 80 85 

gac age acg gtt cet cet gtc cac aeg cac cag ctg gat gtg get gee 

Asp Ser Thr Val Pro Pro Val His Thr His Gin Leu Asp Val Ala Ala 
90 95 100 

aat cag gat gtg get gtg aac ctg ate ate aca gae tac tgc atg cet 

Asn Gin Asp Val Ala Val Asn Leu He He Thr Asp Tyr Cys Met Pro 
105 110 115 



200 



248 



296 



344 



392 
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ggc atg aca gga tat gac ctg etc aag aag ate aag gag teg teg tct 440 
Gly Met Thr Gly Tyr Asp Leu Leu Lys Lys lie Lys Glu Ser Ser Ser 
120 125 • 130 

etc aga gat ate ecg gtg gtg ate atg tec tet gag aac att cet tea 488 
Leu Arg Asp lie Pro Val Val He Met Ser Ser Glu Asn He Pro Ser 
135 140 145 150 

agg ate aat agg tgc etg gag gaa gga get gae gag ttc ttc eta aaa 536 
Arg He Asn Arg Cys Leu Glu Glu Gly Ala Asp Glu Phe Phe Leu Lys 
155 160 165 

cet gtg egg eta tea gac atg aac aag etc aag ccc cac ata ctg aaa 584 
Pro Val Arg Leu Ser Asp Met Asn Lys Leu Lys Pro His lie Leu Lys 
170 175 180 

age aga tge aae eag gaa cag cac eag eaa agt gac agt cac agt ggc 632 
Ser Arg Cys Asn Gin Glu Gin His Gin Gin Ser Asp Ser His Ser Gly 
185 190 195 

gaa cge agg aae ccc aca ate age age age gat age ata aac aac ege 680 
Glu Arg Arg Asn Pro Thr He Ser Ser Ser Asp Ser He Asn Asn Arg 
200 205 210 

aag aga aag ggt gca ggc ace gaa gaa ate ttg ccc cag ctg gca aae . 728 
Lys Arg Lys Gly Ala Gly Thr Glu Glu He Leu Pro Gin Leu Ala Asn 
215 220 225 230 

aga tea agg cac agt taactgagaa etgaetagta eagetaaaae ettttctttt 783 
Arg Ser Arg His Ser 
235 

tcactttact ttggcttact gatttgtaac gtagatgtta gccctggatt tgcaaaacgg B43 

aacggaatct gtaaactgat actgettgag tcgaategat cgaagtgatt ctgcaccaga 903 

tctccctaaa aaaaaaaaaa aaaaaaaaac tegag 938 

<210> 50 
<211> 235 
<212> PRT 
<213> Zea mays 

<400> 50 

Met Pro Ser Gin Ser Thr His Thr Ser Ser Ser Leu Ser Ser Ser Thr 

1 5 10 15 . - 

Thr Pro He Leu Pro Cys Ser Ser Ala Ala Ala Leu Phe Arg Ser Val 

20 25 30 

Met Ala Ala Val Ala Thr Glu Thr Pro Phe His Val Leu Ala Val Asp 

35 40 45 

Asp Ser Leu Pro Asp Arg Lys Leu He Glu Arg Leu Leu Lys Thr Ser 

50 • 55 60 ' 

Ser Phe Gin Val Thr Thr Val Asp Ser Gly Ser Lys Ala Leu Gin Phe 
65 70 75 80. 

Leu Gly Leu His Asp Gin Asp Ser Thr Val Pro Pro Val His Thr His 

85 90 95 

Gin Leu Asp Val Ala Ala Asn Gin Asp Val Ala Val Asn Leu He He 

100 105 110 

Thr Asp Tyr Cys' Met Pro Gly Met Thr Gly Tyr Asp Leu Leu Lys Lys 

115 120 125 

He Lys Glu Ser Ser Ser Leu Arg Asp He Pro Val Val He Met Ser 

130 135 140 

Ser Glu Asn He Pro Ser Arg He Asn Arg Cys Leu Glu Glu Gly Ala 
145 150 155 160 
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Asp Glu Phe Phe Leu Lys Pro Val Arg Leu Ser Asp Met Asn Lys Leu 

165 170 175 

Lys Pro His lie Leu Lys Ser Arg Cys Asn Gin. Glu Gin His Gin Gin 

180 185 190 

Ser Asp Ser His Ser Gly Glu Arg Arg Asn Pro Thr lie Ser Ser Ser 

195 200 205 

Asp Ser lie Asn Asn Arg Lys Arg Lys Gly Ala Gly Thr Glu Glu lie 

210 215 ~' 220 

Leu Pro Gin Leu Ala Asn Arg Ser Arg His Ser 

225 230 235 

<210> 51 
<211> 20 
<212> DNA 
<213> Zea mays 

<400> 51 

atgccaagcc aaagcacaca 20 

<210> 52 
<211> 20 
<212> DNA 
<213> Zea mays 

<400> 52 

ttaactgtgc cttgatctgt 20 

<210> 53 
<211> 1322 
<212> DNA 
<213> Zea mays 

<220> 
<221> CDS 

<222> (217) . . . (864) 
<400> 53 

gtcgacccac gcgtccggta aaaagccacc caccctaccc tcctgcgctg gtactactta €0 
ctacctagcc aggccaagga gcccaacgag ggggaacgct gcggtaggcg cgcgccgctc 120 
gtcccacccc acgccgccat gacggtgcca gangccgagt cgcgcttcca tgtcctcgcg 180 
gtggacgaca gcctcgtcga caggaaaccc atcgag acg ctg etc aag acc teg 234 

Met Leu Leu Lys Thr Ser 
1 . 5. - 

tec tac caa gtg acc acg gtg gat tec ggg age aag gcg ctg gag ctg 282 
Ser Tyr Gin Val Thr Thr Val Asp Ser Gly Ser Lys Ala Leu Glu Leu 

10 15 20 . . 

ctg ggg ctg agg gac gcg teg teg ccg tct ccg tee teg cct gac cac 330 
Leu Gly Leu Arg Asp Ala Ser Ser Pro Ser Pro Ser Ser Pro Asp His 
25 30 35 

cag gag ate gac gtg aat etc ate ate act gac tac tgc atg cca gge 378 
Gin Glu He Asp Val Asn Leu He He Thr Asp Tyr Cys Met Pro Gly 
40 45 50 

atg aca gga tac gat ctg etc aag aga atg aag ggg tec tct teg etc 426 
Met Thr Gly Tyr Asp Leu Leu Lys Arg Met Lys Gly Ser Ser Ser Leu 
55 60 * 65 70 

aag gac att cct gtg gtg ate atg teg tct gag aat gtg eet gee egg 474 
Lys Asp He Pro Val Val He Met Ser Ser Glu Asn Val Pro Ala Arg 
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75 



.8 0 



85 



ate age agg tgc ttg caa gac ggc gcg gag gag ttc trc ctg aag ccc 
lie Ser Arg Cys Leu Gin Asp Gly Ala Glu Glu Phe Phe Leu Lys Pro 
90 ' 95 100 



522 



gtg aag ctg gcc gac atg aag aag etc aag teg cac ctg ctg aaa egg 
Val Lys Leu Ala Asp Met Lys Lys Leu Lys Ser His Leu Leu Lys Arg 
105 110 115 



570 



aag cag ccc aag gag gcg cag gcg cag gcg eag gcg cag gcg cag cag 
Lys Gin Pro Lys Glu Ala Gin Ala Gin Ala Gin Ala Gin Ala Gin Gin 
120 125 130 



618 



gga cag gcg gtg gag ctg gag cct gag cag cag ctg gac ecg cgc acg 
Gly Gin Ala Val Glu Leu Glu Pro Glu Gin Gin Leu Asp Pro Arg Thr 
135 140 145 150 



666 



cag ccg gcg cac gac gcg gag gaa acc gcg gca gag ecg ccg ecg gcc 
Gin Pro Ala His Asp Ala Glu Glu Thr Ala Ala Glu Pro Pro Pro Ala 
155 160 165 



714 



gca tec aac gga acc acc gat ggc ggc aac aag agg aag gcg gca gcc 
Ala Ser Asn Gly Thr Thr Asp Gly Gly Asn Lys Arg Lys Ala Ala Ala 
170 175 180 



762 



atg gag gag gag ggg atg ctg gcc gtg atg acg gtg gcg gcg ccg gag 
Met Glu Glu Glu Gly Met Leu Ala Val Met Thr Val Ala Ala Pro Glu 
185 190 195 



810 



age age ace aag ccg agg ctg tec acc acc acc age age ctg gcg gtg 
Ser Ser Thr Lys Pro Arg Leu Ser Thr Thr Thr Ser Ser Leu Ala Val 
200 205 210 



858 



gaa acc tgagctgaga atcggacggc gaccggatca catattacta cctacgtaca 

Glu Thr 

215 



914 



ttaatccacc 
cccaggccac 
cggagctaat 
ctaagtgaag 
ttgttgtgct 
gtgaaaagga 
tttttaattt 



attgcgggca 
cgccgagaaa 
taatggaatg 
aacaattacg 
cgtggtcatg 
agctgtagat 
tactcggctt 



gtegagaggc 
gagctcgtce 
aaggtgtgag 
acgtacetta 
gacacccttg 
agaaaaggca 
attaaaaaaa 



aaccaaccaa 
tcettttctt 
ctcetgggga 
cetageggct 
tgttcgttag 
ggegtctcat 
aaaaaaaagg 



ecaagaacca 
tcgctggttc 
tggaggggcg 
gctgcttcgg 
gagggggteg 
gctcttgttg 
gcggccgc 



accgaecaaa 
ttttggttct 
gttacgggct 
ccggeaaatt 
999tggaacg 
tetagtctaa 



974 
1034 
1094 
1154 
1214 
1274 
1322 



<210> 54 

<211> 216 

<212> PRT 

<213> Zea mays 



<400> 54 

Met Leu Leu Lys Thr Ser Ser Tyr Gin Val Thr Thr Val Asp Ser Gly 

15 10 15 

Ser Lys Ala Leu Glu Leu Leu Gly Leu Arg Asp Ala Ser Ser Pro Ser 

20 25 30 

Pro Ser Ser Pro Asp His Gin Glu lie Asp Val Asn Leu He He Thr 

35-40 45 
Asp Tyr Cys Met Pro Gly Met Thr Gly Tyr Asp Leu Leu Lys Arg Met 

50 55 60 

Lys Gly Ser Ser Ser Leu Lys Asp He Pro Val Val He Met Ser Ser 
65 70 • 75 80 



wo 00/70059 



-41 - 



PCT/US00/II687 



Glu 


Asn 


Val 


Pro 


Ala 
85 


ArQ 


He 


Ser 


Arg 


Cys 
90 


Leu 


Gin 


Asp 


Gly Ala 
95 


Glu 


Glu 


Phe 


Phe 


Leu 
10 0 


Lys 


Pro 


Val 


Lys 


Leu 
105 


Ala 


Asp 


Met 


Lys 


Lys Leu 
110 


Lys 


Ser 


His 


Leu 
115 


Leu 


Lys 


Arg 


Lys 


Gin 
120 


Pro 


Lys 


Glu 


Ala 


Gin 
125 


Ala Gin 


Ala 


Gin 


Ala 


Gin 


Ala 


Gin 


Gin 


Gly 


Gin 


Ala 


Val 


Glu 


Leu 

1 A n 

X** u 


Glu 


Pro Glu 


Gin 


Gin 


Leu 


Asp 


Pro 


Arg 


Thr 


Gin 


Pro 


Ala 


His 


Asp 


Ala 


Glu 


Glu Thr 


Ala 


145 










150 










155 








160 


Ala 


Glu 


Pro 


Pro 


Pro 


Ala 


Ala 


Ser 


Asn Gly 


Thr 


Thr 


Asp Gly Gly Asn 










165 










170 








175 




Lys 


Arg 


Lys 


Ala 
180 


Ala 


Ala 


Met 


Glu 


Glu 
185 


Glu 


Gly 


Met 


Leu 


Ala Val 
190 


Met 


Thr 


Val 


Ala 
195 


Ala 


Pro 


Glu 


Ser 


Ser 
200 


Thr 


Lys 


Pro 


Arg 


Leu 
205 


Ser Thr 


Thr 


Thr 


Ser 
210 


Ser 


Leu 


Ala 


Val 


Glu 
215 


Thr 

















<21Q> 55 
<211> 20 
<212> DNA 
<213> 2ea mays 

<400> 55 

atgctgctca agacctcgtc 20 

<210> 56 

<211> 20 

<212> DNA 

<213> Zea mays 

<400> 56 

tcaggtttcc accgccaggc 20 

<210> 57 

<211> 1635 

<212> DNA 

<213> Zea mays 

<220> 
<221> CDS 

<222> (1) . . . (1635) 
<400> 57 

atg gat ttt ttc aca gag tac ggc gag ggg age agg tac aag ata gaa 48 
Met Asp Phe Phe Thr Glu Tyr Gly Glu Gly Ser Arg Tyr Lys lie Glu 
15 10 15 

gag gtc ata ggg aaa ggc agt tac ggt gtg grt tgc tct gcc ttg gac 96 
Glu Val He Gly Lys Gly Ser Tyr Gly Val Val Cys Ser Ala Leu Asp 
20 25 30 

act cat act ggt gaa aag gtc gcc aca aag aag ata aat gac ate ttt ■ 144 
Thr His Thr Gly Glu Lys Val Ala He Lys Lys He Asn Asp He Phe 
35 40 45 

gag cat gtg tct gat gcg aca agg ata ctt egg gaa ate aag ttg ctt 192 
Glu His Val Ser Asp Ala Thr Arg lie Leu Arg Glu He Lys Leu Leu 
50 55 60 

aga etc ctg egg cat ccg gat ate gtg gaa ata aag cat att eta ctt 240 
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Arg Leu Leu Arg His Pro Asp He Val Glu He Lys His He Leu Leu 
65 70 75 80 

cct cca tea agg agg gaa ttc agg gat att tac gtt gtg ttt gaa ctt 288 
Pro Pro Ser Arg Arg Glu Phe Arg Asp lie Tyr Val Val Phe Glu Leu 
85 90 95 

atg gag tct gat ttg cac caa gtt ata aag get aat gat gac ttg acc 336 
Met Glu Ser Asp Leu His Gin Val He Lys Ala Asn Asp Asp Leu Thr 
100 105 110 

cca gag cat tat cag ttc ttc ttg tat cag ttg ctt cga ggt ttg aaa 384 
Pro Glu His Tyr Gin Phe Phe Leu Tyr Gin Leu Leu Arg Gly Leu Lys 
115 120 125 

tac ata cac aca gca aat gta ttc cat cga gat etc aaa cca aaa aat 432 
Tyr He His Thr Ala Asn Val Phe His Arg Asp Leu Lys Pro Lys Asn 
130 135 140 

att ctg gca aat get gat tgt aag etc aaa ata tgt gat ttt ggc eta 480 
He Leu Ala Asn Ala Asp Cys Lys Leu Lys He Cys Asp Phe Gly Leu 
145 150 155 160 

gca aga gta get ttc agt gat act cca acc gcg ate ttt tgg aca gat 528 
Ala Arg Val Ala Phe Ser Asp Thr Pro Thr Ala He Phe Trp Thr Asp 
165 170 175 

tat gtt gca act agg tgg tac cga gca ccg gag ctg tgt gga tea ttt 576 
Tyr Val Ala Thr Arg Trp Tyr Arg Ala Pro Glu Leu Cys Gly Ser Phe 
180 185 190 

ttc tea aag tat aca cca gcg ata gat ata tgg agt att ggt tgt ata 624 
Phe Ser Lys Tyr Thr Pro Ala He Asp He Trp Ser He Gly Cys He 
195 200 205 

ttt gcg gaa eta tta act gga aaa ccg ctt ttc ccg ggg aaa aat gtg 672 
Phe Ala Glu Leu Leu Thr Gly Lys Pro Leu Phe Pro Gly Lys Asn Val 
210 215 220 

gta cat caa ctt gat ata att aca gat etc atg gga aca cct tct cca 720 
Val His Gin Leu Asp He He Thr Asp Leu Met Gly Thr Pro Ser Pro 
225 230 235 240 

gaa gca ata teg agg att egt aat gag aag gee agg cgc tac ttg agt 768 
Glu Ala He Ser Arg He Arg Asn Glu Lys Ala Arg Arg Tyr Leu Ser 
. 245 250 255 

aat atg agg egg aaa aag ecc ata cct ttt aca cag aag ttc cca aat 816 
Asn Met Arg Arg Lys Lys Pro He Pro Phe Thr Gin Lys Phe Pro Asn 
260 265 270 

gca gat cca ctt gca tta ggt ttg tta gag cga atg eta get ttt gag 864 
Ala Asp Pro Leu Ala Leu Gly Leu Leu Glu Arg Met Leu Ala Phe Glu 
275 280 285 

cca aaa gac agg cca agt get gaa gag get ctt get cat cct tat ttt 912 
Pro Lys Asp Arg Pro Ser Ala Glu Glu Ala Leu Ala His Pro Tyr Phe 
290 295 300 

aag aat ata gcc aat gtg gat aga gag cct tct gca caa get gte aca 960 
Lys Asn He Ala Asn Val Asp Arg Glu Pro Ser Ala Gin Ala Val Thr 
305 310 315 320 
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aag cnc gaa ttt gag ttc gag aga egg aga etc act aaa gat gac ata 

Lys Leu Glu Phe Glu Phe Glu Arg Arg Arg Val- Thr Lys Asp Asp He 

325 330 335 

agg gaa ctt ata tat aga gaa att ctt gag tat cat cca aag atg ttg 

Arg Glu Leu He Tyr Arg Glu He Leu Glu Tyr His Pro Lys Met Leu 

340 345 350 

aaa gaa ttt att gag gga act gag tea agt ggt ttc atg tac cca agt 

Lys Glu Phe He Glu Gly Thr Glu Ser Ser Gly Phe Met Tyr Pro Ser 
355 360 365 

gca gta gat aat ttt aaa aag cag ttt gca tac ctt gag gaa cat tat 

Ala Val Asp Asn Phe Lys Lys Gin Phe Ala Tyr Leu Glu Glu His Tyr 
370 375 380 

gcc aag gga tct acg gga act cca cct gag agg caa cat aac tea ttg 

Ala Lys Gly Ser Thr Gly Thr Pro Pro Glu Arg Gin His Asn Ser Leu 

385 390 395 400 



gac act gag agg atg gac gtg aac caa get ggg cag ccg aag tec tat 

Asp Thr Glu Arg Met Asp Val Asn Gin Ala Gly Gin Pro Lys Ser Tyr 

515 520 525 

gca gca aac aaa eta cct gca act gtt gat ggt cgc ggc ggc cat tgg 

Ala Ala Asn Lys Leu Pro Ala Thr Val Asp Gly Arg Gly Gly His Trp 
530 535 540 

tag 



1008 



1056 



1104 



1152 



1200 



cca egg cct tct gtt gtc tat tea gac aac caa tea caa age acg gcc 1248 
Pro Arg Pro Ser Val Val Tyr Ser Asp Asn Gin Ser Gin Ser Thr Ala 
405 410 415 

aat ata aca gag gat ctt tec aag tgt ata ate aga gaa aat gca cag 1296 
Asn He Thr Glu Asp Leu Ser Lys Cys He He Arg Glu Asn Ala Gin 
420 425 430 

aag tea cag caa tat aat get tea gtt gca aat aaa ttt cca tee eat 
Lys Ser Gin Gin Tyr Asn Ala Ser Val Ala Asn Lys Phe Pro Ser His 
435 440 445 

gtt cct caa ggt get get gca agg cct ggt aaa gca gtt ggt teg gtg 
Val Pro Gin Gly Ala Ala Ala Arg Pro Gly Lys Ala Val Gly Ser Val 
450 455 460 

ctg cag tac aac etc tct caa gca tct get get gag caa tat gag cag 
Leu Gin Tyr Asn Leu Ser Gin Ala Ser Ala Ala Glu Gin Tyr Glu Gin 
465 470 475 480 

cga agg gtt get agg cac cca gca gtt gcc cca aac aac ate cct tct 
Arg Arg Val Ala Arg His Pro Ala Val Ala Pro Asn Asn He Pro Ser 
485 490 495 



1344 



1392 



1440 



148B 



gga age tea tac cec aga aga aat cag acc tge aag age gag aca ggc 1536 
Gly Ser Ser Tyr Pro Arg Arg Asn Gin Thr Cys Lys Ser Glu Thr Gly 
500 505 510 



1584 



1632 



1635 
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<210> 58 
<2ia> 544 
<212> PRT 
<213> Zea mays 





<400> 


58 


























Met 


Asp 


Phe 


Phe 


Thr 


Glu 


Tyr 


Gly 


Glu 


Gly 


Ser 


Arg 


Tyr 


Lys 


He 


Glu 


1 








5 










10 










15 




Glu 


Val 


He 


Gly 


Lys 


Gly 


Ser 


Tyr 


Gly 


Val 


val 


Cys 


Ser 


Ala 


Leu 


Asp 








20 










25 










30 






Thr 


His 


Thr 


Gly 


Glu 


Lys 


Val 


Ala 


He 


Lys 


Lys 


He 


Asn 


Asp 


He 


Phe 






35 










40 










45 








Glu 


His 


Val 


Ser 


Asp 


Ala 


Thr 


Arg 


He 


Leu 


Arg 


Glu 


He 


Lys 


Leu 


Leu 




50 










55 










60 










Arg 


Leu 


Leu 


Arg 


His 


Pro 


Asp 


He 


Val 


Glu 


He 


Lys 


His 


He 


Leu 


Leu 


65 










70 










75 










80 


Pro 


Pro 


Ser 


Arg 


Arg 


Glu 


Phe 


Arg 


Asp 


He 


Tyr 


Val 


Val 


Phe 


Glu 


Leu 










85 










90 










95 




Met 


Glu 


Ser 


Asp 


Leu 


His 


Gin 


Val 


He 


Lys 


Ala 


Asn 


Asp 


Asp 


Leu 


Thr 








100 










105 










110 






Pro 


Glu 


His 


Tyr 


Gin 


Phe 


Phe 


Leu 


Tyr 


Gin 


Leu 


Leu 


Arg 


Gly 


Leu 


Lys 






115 










120 










125 








Tyr 


He 


His 


Thr 


Ala 


Asn 


Val 


Phe 


His 


Arg 


Asp 


Leu 


Lys 


Pro 


Lys 


Asn 




130 










135 










140 










He 


Leu 


Ala 


Asn 


Ala 


Asp 


Cys 


Lys 


Leu 


Lys 


He 


Cys 


Asp 


Phe 


Gly 


Leu 


145 










150 










155 










160 


Ala 


Arg 


Val 


Ala 


Phe 


Ser 


Asp 


Thr 


Pro 


Thr 


Ala 


He 


Phe 


Trp 


Thr 


Asp 










165 










170 










175 




Tyr 


Val 


Ala 


Thr 


Arg 


Trp 


Tyr 


Arg 


Ala 


Pro 


Glu 


Leu 


Cys 


Gly 


Ser 


Phe 








180 










185 










190 






Phe 


Ser 


Lys 


Tyr 


Thr 


Pro 


Ala 


He 


Asp 


He 


Trp 


Ser 


He 


Gly 


Cys 


He 






195 










200 










205 








Phe 


Ala 


Glu 


Leu 


Leu 


Thr 


Gly 


Lys 


Pro 


Leu 


Phe 


Pro 


Gly 


Lys 


Asn 


Val 




210 










215 










220 










Val 


His 


Gin 


Leu 


Asp 


He 


He 


Thr 


Asp 


Leu 


Met 


Gly Thr 


Pro 


Ser 


Pro 


225 










230 










235 










240 


Glu 


Ala 


He 


Ser 


Arg 


He 


Arg 


Asn 


Glu 


Lys 


Ala 


Arg 


Arg 


Tyr 


Leu 


Ser 










245 










250 










255 




Asn 


Met 


Arg 


Arg 


Lys 


Lys 


Pro 


He 


Pro 


Phe 


Thr 


Gin 


Lys 


Phe 


Pro 


Asn 








260 










265 










270 






Ala 


Asp 


. Pro 


Leu 


Ala 


Leu 


Gly 


Leu 


Leu 


Glu 


Arg 


Met 


Leu 


Ala 


Phe 


Glu 






275 










280 










285 








Pro 


Lys 


Asp 


Arg 


Pro 


Ser 


Ala 


Glu 


Glu 


Ala 


Leu 


Ala 


His 


Pro 


Tyr 


Phe 




290 


: - ■ 








295 










300 










Lys 


Asn 


He 


Ala 


Asn 


Val 


Asp 


Arg 


Glu 


Pro 


Ser 


Ala 


Gin 


Ala 


Val 


Thr^ 


305 










310 










315 










320 


Lys 


Leu 


Glu 


Phe 


Glu 


Phe 


Glu 


Arg 


Arg 


Arg 


Val 


Thr 


Lys 


Asp 


Asp 


He 










325 










330 










335 




Arg 


Glu 


Leu 


lie 


Tyr- 


Arg 


Glu 


He 


Leu 


Glu 


Tyr 


His 


Pro 


Lys 


Met 


Leu 








340 










345 










350 






Lys 


Glu 


Phe 


He 


Glu 


Gly 


Thr 


Glu 


Ser 


Ser 


Gly 


Phe 


Met 


Tyr 


Pro 


Ser 






355 










360 










365 








Ala 


Val 


Asp 


Asn 


Phe 


Lys 


Lys 


Gin 


Phe 


Ala 


Tyr 


Leu 


Glu 


Glu 


His 


Tyr 




370 










375 










380 










Ala 


Lys 


Gly 


Ser 


Thr 


Gly Thr 


Pro 


Pro 


Glu 


Arg 


Gin 


His 


Asn 


Ser 


Leu 


385 










390 










395 










400 


Pro 


Arg 


Pro 


Ser Val 


Val 


Tyr 


Ser 


Asp 


Asn 


Gin 


Ser 


Gin 


Ser 


Thr 


Ala 










405 










410 










415 




Asn 


He 


Thr 


Glu 


Asp 


Leu 


Ser 


Lys 


Cys 


He 


He 


Arg 


Glu 


Asn 


Ala 


Gin 








420 










425 










430 






Lys 


Ser 


Gin 


Gin 


Tyr 


Asn 


Ala 


Ser 


Val 


Ala 


Asn 


Lys 


Phe 


Pro 


Ser 


His 






435 










440 










445 
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Val 


Pro 
450 


Gin 


Gly 


Ala 


Ala 


Ala 
455 


Arg 


Pro 


Gly 


Lys 


Ala 
460 


Val 


Gly 


Ser 


Val 


Leu 


Gin 


Tyr 


Asn 


Leu 


Ser 


Gin 


Ala 


Ser 


Ala 


Ala 


Glu 


Gin 


Tyr 


Glu 


Gin 


465 










470 










475 










480 


Arg 


Arg 


Val 


Ala 


Arg 
485 


His 


Pro 


Ala 


Val 


Ala 
490 


Pro 


Asn 


Asn 


He 


Pro 
495 


Ser 


Gly 


Ser 


Ser 


Tyr 


Pro 


Arg 


Arg 


Asn 


Gin 


Thr 


Cys 


Lys 


Ser 


Glu 


Thr Gly 








500 










505 










510 






Asp 


Thr 


Glu 
515 


Arg 


Met 


Asp 


Val 


Asn 
520 


Gin 


Ala 


Gly 


Gin 


Pro 
525 


Lys 


Ser 


Tyr 


Ala 


Ala 
530 


Asn 


Lys 


Leu 


Pro 


Ala 
535 


Thr 


Val 


Asp 


Gly 


Arg 
540 


Gly 


Gly 


His 


Trp 



<210> 59 
<211> 25 
<212> DNA 
<213> Zea mays 



<400> 59 

atggattttt tcacagagta cggcg 25 

<210> 60 

<211> 20 

<212> DNA 

<213> Zea mays 

<400> 60 

tcaatgcgaa accaaattta 20 



<210> 61 
<211> 4014 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1) . . . (4014) 



<400> 61 

atg gcg teg agg cag cac aat gcg cag etc cac aaa tec aag acg ctt 48 

Met Ala Ser Arg Gin His Asn Ala Gin Phe His Lys Ser Lys Thr Leu 

15 10 15 

gac aac aaa tac atg ctt gga gat gaa ata ggg aag ggg gcg tac ggc 96 

Asp Asn Lys Tyr Met Leu Gly Asp Glu He Gly Lys Gly Ala Tyr Gly 
20 25 30 

cgc gta tac aag ggg ctt gac ctg gag aat ggc gat ttc gtg gcc ate 144 

Arg Val Tyr Lys Gly Leu Asp Leu Glu Asn Gly Asp Phe Val Ala He 
35 40 45 

aaa cag gtc teg ctg gag aac att ccg cag gag gat etc aac ata ata 192 

Lys Gin Val Ser Leu Glu Asn He Pro Gin Glu Asp Leu Asn He He 
50 55 60 



atg caa gag ate gac ctt ttg aaa aat ctt aat cat aaa aat att gtc 240 
Met Gin Glu He Asp Leu Leu Lys Asn Leu Asn His Lys Asn He Val 
65 70 75 80 

aag tat ttg gga tea ttg aag aca aag age cac etc cat att att ttg 288 
Lys Tyr Leu Gly Ser Leu Lys Thr Lys Ser His Leu His He lie Leu 
85 90 95. 
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gag tat gtg gag aat ggc tea ctt get aat att ate aag cca aac aaa 336 

Glu Tyr Val Glu Asn Gly Ser Leu Ala Asn He He Lys Pro Asn Lys 

100 105 110 

ttc gga cct ttt cct gaa tct ttg gtg get gta tac att get cag gtg 384 

Phe Gly Pro Phe Pro Glu Ser Leu Val Ala Val Tyr He Ala Gin Val 

115 120 125 

ttg gaa ggt ctt gtt tat ctg cat gaa caa ggt gtc att cat aga gat 432 

Leu Glu Gly Leu Val Tyr Leu His Glu Gin Gly Val He His Arg Asp 

130 135 140 

ate aag ggc gca aat ata ctg act acc aaa gag ggc ctt gtc aaa ctt 480 

He Lys Gly Ala Asn He Leu Thr Thr Lys Glu Gly Leu Val Lys Leu 

145 150 155 160 

get gac ttt gga gtt gcc act aaa ttg act gaa get gac ate aac act 528 

Ala Asp Phe Gly Val Ala Thr Lys Leu Thr Glu Ala Asp He Asn Thr 

165 170 175 

cat tct gtg gtc ggc act cca tac tgg atg gcg cct gag gtc ate gaa 576 

His Ser Val Val Gly Thr Pro Tyr Trp Met Ala Pro Glu Val He Glu 

180 185 190 

atg tct ggt gtt tgt get gcc tct gat ate tgg agt gtg ggt tgc acc 624 

Met Ser Gly Val Cys Ala Ala Ser Asp He Trp Ser Val Gly Cys Thr 

195 200 205 

gta att gaa ttg ctg aca tgt gtc ccg cca tat tat gat etc caa cct 672 

Val He Glu Leu Leu Thr Cys Val Pro Pro Tyr Tyr Asp Leu Gin Pro 

210 215 220 

atg cct gcc ctg ttc cgt att gtt cag gat gtg cat cca cca ata cca 720 

Met Pro Ala Leu Phe Arg He Val Gin Asp Val His Pro Pro He Pro 

225 230 235 240 

gaa gga ctg tea cct gag att act gat ttt etc egg caa tgt ttt caa 768 

Glu Gly Leu Ser Pro Glu He Thr Asp Phe Leu Arg Gin Cys Phe Gin 

245 ' 250 255 

aag gat gcg atg caa agg cct gat geg aag aca tta ttg atg cat cca 816 

Lys Asp Ala Met Gin Arg Pro Asp Ala Lys Thr Leu Leu Met His Pro 

- 260 265 270 

^99 ttg cag aac tea aga cgt get ttg cct get tec ctt egg caa cct 864 

Trp Leu Gin Asn Ser Arg Arg Ala Leu Pro Ala Ser Leu Arg Gin Pro 

275 280 285 

act cca tta agg aat ate gac ggg gat gat gaa age tea aga ggg gac 912 

Thr Pro Leu Arg Asn He Asp Gly Asp Asp Glu Ser Ser Arg Gly, Asp 

290 295 300 

aat aac teg ggt ttt tgc gac aca cca gga gat acc cga gca act att 960 

Asn Asn Ser Gly Phe Cys Asp Thr Pro Gly Asp Thr Arg Ala Thr He 

305 310 315 320 

get tct aat gtt gac cag gtg aat ggg agg aac gga cca aat atg gac 1008 

Ala Ser Asn Val Asp Gin Val Asn Gly Arg Asn Gly Pro Asn Met Asp 

325 330 335 

tct get gcg caa agt aaa tct gaa gag ctt eat gat gga aat tta gaa 10S6 
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Ser Ala Ala Gin Ser Lys Ser Glu Glu laeu Kis Asp Gly Asn Leu Glu 
340 345 350 

ctt aca gag ggc att agt teg aac aac gtg gcg ctt gtg aaa gat aat 1104 
Leu Thr Glu Gly He Ser Ser Asn Asn Val Ala Leu Val Lys Asp Asn 
355 360 365 

gtg gtc ctt aat aaa gat ccg aca eta gtt tta cat gag aag tta ccg 1152 
Val Val Leu Asn Lys Asp Pro Thr Leu Val Leu His Glu Lys Leu Pro 
370 375 380 

gtg gaa tct tea ttt gga gat get gat ttg aat ggc aaa gta atg gca 1200 
Val Glu Ser Ser Phe Gly Asp Ala Asp Leu Asn Gly Lys Val Met Ala 
385 390 395 400 

cat gaa eta ttg caa ggt ggt ctg cea agt aag ata gag ctg gag aat 1248 
His Glu Leu Leu Gin Gly Gly Leu Pro Ser Lys He Glu Leu Glu Asn 
405 410 415 

aaa gaa agt tea act gtt gag gat ggt gat gta ttc tea ttt cag get 1296 
Lys Glu Ser Ser Thr Val Glu Asp Gly Asp Val Phe Ser Phe Gin Ala 
420 425 430 

gga aga cag aat att gac ttc caa atg gtg gtt gaa ect tea gtt gtt 1344 
Gly Arg Gin Asn He Asp Phe Gin Met Val Val Glu Pro Ser Val Val 
435 440 445 

9^9 gga ccg aaa cag ctt agt aga ttt agt gac aaa ect gaa gat gee 1392 
Glu Gly Pro Lys Gin Leu Ser Arg Phe Ser Asp Lys Pro Glu Asp Ala 
450 455 460 

tct ttg gaa gac tta ttt eca eca ate gac aaa egg gga gat aat ggg 1440 
Ser Leu Glu Asp Leu Phe Pro Pro He Asp Lys Arg Gly Asp Asn Gly 
465 470 475 480 

get gaa ect tea aca tea act act gtc caa gaa ctt caa tat aat ggt 1488 
Ala Glu Pro Ser Thr Ser Thr Thr Val Gin Glu Leu Gin Tyr Asn Gly 
485 490 495 

gta cac aat gga ttc gtg aag ggg etc aat gee agg gtg gag aag caa 153 6 

Val His Asn Gly Phe Val Lys Gly Leu Asn Ala Arg Val Glu Lys Gin 
500 505 510 

aaa gaa aat gat age gag tec atg aat ggt gga aaa eta att gaa ttt 1584 
Lys Glu Asn Asp Ser Glu Ser Met Asn Gly Gly Lys Leu He Glu Phe 
515 520 525 

gcc atg cag eta gaa aat att gat gca teg ggt ttt ggt gaa cat att 1632 
Ala Met Gin Leu Glu Asn He Asp Ala Ser Gly Phe Gly Glu His He 
530 535 540 

ect gga gag agt ctt ttc cet ctg cag tct gtg gag tae age aaa ata. 1680 
Pro Gly Glu Ser Leu Phe Pro Leu Gin Ser Val Glu Tyr Ser Lys He 
545 550 555 560 

gta gca caa ctg aaa cea ggg gaa agt gaa gat gta ata ctg ttg gcg 1728 
Val Ala Gin Leu Lys Pro Gly Glu Ser Glu Asp Val He Leu Leu Ala 
565 570 575 

tgc caa aag ctt eta tea ate ttc agt cae egg ect gag caa aaa cag 1776 
Cys Gin Lys Leu Leu Ser He Phe Ser His Arg Pro Glu Gin Lys Gin 
580 585 . 590 
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ate tat atg gca cag aat ggt ttc etc ccg ctg atg gaa etc ett gaa 1824 

lie Tyr Met Ala Gin Asn Gly Phe Leu Pro Leu Met Glu Leu Leu Glu 
595 600 605 

att cct aaa aat cgt att tta tgc tea gtt ctg cag ctt ate aac aac 1872 

lie Pro Lys Asn Arg lie Leu Cys Ser Val Leu Gin Leu lie Asn Asn 

610 615 620 

att gta aaa gat agt aca ggc ttc ctg gaa aat get tgc ctt gtt ggc 1920 

lie Val Lys Asp Ser Thr Gly Phe Leu Glu Asn Ala Cys Leu Val Gly 

625 630 635 640 

ctt att cca gtg gtg atg aat ttt gca gaa cca aat cgc cca aag gat 1968 

Leu lie Pro Val Val Met Asn Phe Ala Glu Pro Asn Arg Pro Lys Asp 
645 650 655 

gtt egg gtg caa gca tec ttg ttc ctt cag cag ett tgt caa gcc age 2016 

Val Arg Val Gin Ala Ser Leu Phe Leu Gin Gin Leu Cys Gin Ala Ser 
660 665 670 

acc ttg aca ttg caa atg ttc att gca tgt caa ggg ate cct gtt ttg 2064 

Thr Leu Thr Leu Gin Met Phe lie Ala Cys Gin Gly lie Pro Val Leu 
675 680 685 

gta age ttt atg gag cct gac tat gca aaa tac agg gat atg gtt eat 2112 

Val Ser Phe Met Glu Pro Asp T\^r Ala Lys Tyr Arg Asp Met Val His 

690 695 700 

ctt gca att gac ggc att tgg cag gtc ttt aag etc cag cae tea act 2160 

Leu Ala lie Asp Gly lie Trp Gin Val Phe Lys Leu Gin His Ser Thr 

705 710 715 720 

cea aga aat gac ttc tgc cgt ata gca gca aaa aat ggg atg ctt cct 2208 

Pro Arg Asn Asp Phe Cys Arg lie Ala Ala Lys Asn Gly Met Leu Pro 
725 730 735 

agg ctg gtg aat aca eta tat age ttg aac gaa gca aca cga ttt gcc 2256 

Arg Leu Val Asn Thr Leu Tyr Ser Leu Asn Glu Ala Thr Arg Phe Ala 
740 745 750 

tea gtc tea ggg tea ggt get tea gta aca cag aat ggc tea acc ccc 2304 

Ser Val Ser Gly Ser Gly Ala Ser Val Thr Gin Asn Gly Ser Thr Pro 
755 760 765 

cga cga aga tet ggc caa tta gac cct tea gtt ett gaa agt tgc aag 2352 

Arg Arg Arg Ser Gly Gin Leu Asp Pro Ser Val Leu Glu Ser Cys Lys 

770 775 780 

gcg agg ctg gat cat cat cat tea tet ggt tet ttg caa tet eta caa 2400 

Ala Arg Leu Asp His His His Ser Ser Gly Ser Leu Gin Ser Leu Gin 

785 790 795 800 

gca gat gca gat aag cae eac ata tta atg gae tec teg tea tet cct 2448 

Ala Asp Ala Asp Lys His His He Leu Met Asp Ser Ser Ser Ser Pro 
805 810 815 

agg ttc age gae aaa aca age agt aac ttg gag agg aat gag aac gae 24 96 

Arg Phe Ser Asp Lys Thr Ser Ser Asn Leu Glu Arg Asn Glu Asn Asp 
820 825 830 

ttg gtt att egg cca cag agg ctt agt gtt tet gee gga agg aca tet 2544 
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Leu Val lie Arg Pro Gin Arg Leu Ser Val Ser Ala Gly Arg Thr Ser 
835 840 845 

aca gac aga tea ccg aag cac ata gaa eta gtt tea aat ggt cat aat 2592 
Thr Asp Arg Ser Pro Lys His He Glu Leu Val Ser Asn Gly His Asn 
850 855 860 

aac agt caa aat gac cag gtc cga cct eta ctg agt tta ttg gag aaa 2640 
Asn Ser Gin Asn Asp Gin Val Arg Pro Leu Leu Ser Leu Leu Glu Lys 
865 870 875 880 

gaa cca cct tct cga cat gta tct gga caa ctt gat tat gtt cgc cat 2688 
Glu Pro Pro Ser Arg His Val Ser Gly Gin Leu Asp Tyr Val Arg His 
885 890 895 

ata tct ggt ttg gaa agg cat gaa acc att ctg cca eta tta cat aca 2736 
He Ser Gly Leu Glu Arg His Glu Thr He Leu Pro Leu Leu His Thr 
900 905 910 

teg aca gag agg aaa acg aat ggt gaa ctt gac tta atg atg gat ttt 2784 
Ser Thr Glu Arg Lys Thr Asn Gly Glu Leu Asp Leu Met Met Asp Phe 
915 920 925 

get gag gtt tct aga cat gga agg gag aat ggc cat ctg gac tct agt 2832 
Ala Glu Val Ser Arg His Gly Arg Glu Asn Gly His Leu Asp Ser Ser 
930 935 940 

ata aaa get tea aat aga gtt cat agt atg aaa tat gca cct tea gca 2880 
He Lys Ala Ser Asn Arg Val His Ser Met Lys Tyr Ala Pro Ser Ala 
945 950 955 960 

agt gcg tct aat gaa ggg gca tea act tct gga get gca tea caa act 2928 
Ser Ala Ser Asn Glu Gly Ala Ser Thr Ser Gly Ala Ala Ser Gin Thr 
965 970 975 

gca tct ggt gtg tta tea gga tea ggg gtg etc aat tta aga cct ggg 2976 
Ala Ser Gly Val Leu Ser Gly Ser Gly Val Leu Asn Leu Arg Pro Gly 
980 985 990 

agt aca aca tea tea ggc cca tta get cag atg ttc tct tec atg age 3024 
Ser Thr Thr Ser Ser Gly Pro Leu Ala Gin Met Phe Ser Ser Met Ser 
995 1000 1005 

gca gat atg gca cgt gaa tat ctt gag aaa gta gca gat ctt ctt ttg 3072 
Ala Ayp Val Ala Arg Glu Tyr Leu Glu Lys Val Ala Asp Leu Leu Leu 
1010 1015 1020 

gag ctt gca cag gca gac acc gtg gta aaa tct ctg atg gee age caa 3120 
Glu Phe AU Gin Ala Asp Thr Val Val Lys Ser Leu Met Ala Ser Gin 
1025 1030 1035 1040 

age ctt ctt aca agg gtt ttc caa atg ttt aac aag ata gag cct cct 3168 
Ser Leu Leu Thr Arg Val Phe Gin Met Phe Asn Lys He Glu Pro Pro 
1045 1050 1055 

att ctt ctt aag att ctt agg tgc att aat cat ttg teg ggt gat cct 3216 
He Leu Leu Lys He Leu Arg Cys He Asn His Leu Ser Gly Asp Pro 
1060 1065 1070 

aat tgt ctt gag aca ctt caa cgc aca gat gca ate aaa cat ctg ata 3264 
Asn Cys Leu Glu Thr Leu Gin Arg Thr Asp Ala He Lys His Leu He 
1075 1080 1085 
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ccg att ctg gag ctt cgt gat gga cct ctg grt tat caa ata cat agt 3312 
Pro lie Leu Giu Leu Arg Asp Gly Pro Leu Val Tyr Gin lie His Ser 
1090 1095 1100 

gag gtc ctt aat get ttg ttc aac ctt tgc aaa att aat aaa aga aga 3360 
Glu Val Leu Asn Ala Leu Phe Asn Leu Cys Lys He Asn Lys Arg Arg 
1105 1110 1115 1120 

cag gaa caa gca gcc gaa aat gga ate ate cct cac ttg atg aaa ttt 3408 
Gin Glu Gin Ala Ala Glu Asn Gly He lie Pro His Leu Met Lys Phe 
1125 1130 1135 

gtc atg tea gac tea cct ttg egg cag tat gcc ctg cct ttg ctg tgt 3456 
Val Met Ser Asp Ser Pro Leu Arg Gin Tyr Ala Leu Pro Leu Leu Cys 
1140 1145 1150 

gat atg get cat get tct cgc aac tct aga gag cag ttg agg gtt cat 3504 
Asp Met Ala His Ala Ser Arg Asn Ser Arg Glu Gin Leu Arg Val His 
1155 1160 1165 

992 gga eta gat gtg tac tta aat ctg ctg gag gat gat gca tgg gca 3552 
Gly Gly Leu Asp Val Tyr Leu Asn Leu Leu Glu Asp Asp Ala Trp Ala 
1170 1175 1180 

tgc acg gca tta gat tct att get gtt tgt ttg get cac gat aat gat 3600 
Cys Thr Ala Leu Asp Ser He Ala Val Cys Leu Ala His Asp Asn Asp 
1185 1190 1195 1200 

cac aga aaa gtg gaa caa gca ctg ttg aaa aaa gag gcc att cag aaa 3648 
His Arg Lys Val Glu Gin Ala Leu Leu Lys Lys Glu Ala He Gin Lys 
1205 1210 1215 

tta gtg aag ttc ttc caa gat tgc cca gaa cag tat ttt ggc cat ata 3696 
Leu Val Lys Phe Phe Gin Asp Cys Pro Glu Gin Tyr Phe Gly His He 
1220 1225 1230 

ttg gac get ttc ttg aag ata ate acg aaa tct tct agg eta aat aca 3744 
Leu Asp Ala Phe Leu Lys He He Thr Lys Ser Ser Arg Leu Asn Thr 
1235 1240 1245 

gca atg get. acc aat ggt tta aca aca ttg etc att gca aga ctt gac 3792 
Ala Met Ala Thr Asn Gly Leu Thr Thr Leu Leu He Ala Arg Leu Asp 
1250 1255 1260 

cat cga gaa get att gcc cgc ttg act ctg ctg aaa etc ata aag gtt 3840 
His Arg Glu Ala He Ala Arg Leu Thr Leu Leu Lys Leu He Lys Val • 
1265 1270 1275 1280 

gtc tat gag cac cat cct cga cca aaa cag etc ata gta gag aat gac 3888 
Val Tyr Glu His His Pro Arg Pro Lys Gin Leu He Val Glu Asn Asp 
1285 1290 1295 

etc cct cag aag eta cag aac etc att gaa gag cgc agg gat gga caa 3936 
Leu Pro Gin Lys Leu Gin Asn Leu He Glu Glu Arg Arg Asp Gly Gin 
1300 1305 1310 

egg ggc ggc caa caa gtg ttg gtc aaa caa atg gcc acc tea ttg ctg 3984 
Arg Gly Gly Gin Gin Val Leu Val Lys Gin Met Ala Thr Ser Leu Leu 
1315 1320 1325 

aaa gcg ctg cac ate aat aca gta ttg tga 4014 
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Ser 


Ala 


Ala 


Gin 


Ser 


Lys 








340 
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Thr Val Leu 
1335 



Asn 


Ala 


Gin 


Phe 


His 


Lys 








10 






Gly 


Asp 


Glu 


He 


Gly 


Lys 






25 








Asp 


Leu 


Glu 


Asn 


Gly 


Asp 




40 










Asn 


He 


Pro 


Gin 


Glu 


Asp 


55 










60 
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Lys 
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His 










75 




Lys 
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Lys 


Ser 


His 
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Ser 


Leu 


Ala 
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He 


He 
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Ser 


Leu 


Val 


Ala 


Val 


Tyr 
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His 


Glu 
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Gly 


Val 
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Thr 


Thr 


Lys 


Glu 


Gly 
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He 
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Asp 
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Ser 
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Gly 


Asp 
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315 




val 
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Gly 
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Asn 


Gly 
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Ser 


Glu 


Glu 
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His 


Asp 
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Ser 
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Val 


Ala 
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Thr 
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Val 
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His 
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Asp 


Leu 
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Gly 
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Gly 


Leu 


Pro 


Ser 


Lys 


He 
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Ser 


Lys 


Thr 


Leu 






15 




Gly Ala 


Tyr 


Gly 




30 






Phe 


Val 


Ala 


He 


45 








Leu 


Asn 


He 


He 
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He 


Val 








80 


His 


He 


He 


Leu 






95 




Lys 


Pro 


Asn 


Lys 




110 






He 


Ala 


Gin 


Val 


125 








He 


His 


Arg 


Asp 


Leu 


Val 


Lys 


Leu 








160 


Asp 


He 


Asn 


Thr 






175 




Glu 


Val 


He 


Glu 




190 






Val 


Gly 


Cys 


Thr 


205 








Asp 


Leu 


Gin 


Pro 


Pro 


Pro 


He 


Pro 








240 


Gin 


Cys 


Phe 


Gin 






255 




Leu 


Met 
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Pro 
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Leu 


Arg 


Gin 


Pro 


285 








Ser 


Arg Gly 


Asp 


Arg 


Ala 


Thr 


He 








320 


Pro 


Asn 


Met 


Asp 






335 




Gly Asn 


Leu 


Glu 
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Val 
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Asp 


Asn 
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Glu 
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Glu 
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Glu 
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415 
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Lys Glu Ser Ser Thr Val Glu Asp Gly Asp Val Phe Ser Phe Gin Ala 

420 425 430 

Gly Arg Gin Asn lie Asp Phe Gin Met Val Val Glu Pro Ser Val Val 

435 440 445 

Glu Gly Pro Lys Gin Leu Ser Arg Phe Ser Asp Lys Pro Glu Asp Ala 

450 455 460 

Ser Leu Glu Asp Leu Phe Pro Pro lie Asp Lys Arg Gly Asp Asn Gly 
465 470 475 480 

Ala Glu Pro Ser Thr Ser Thr Thr Val Gin Glu Leu Gin Tyr Asn Gly 

485 490 495 

Val His Asn Gly Phe Val Lys Gly Leu Asn Ala Arg Val Glu Lys Gin 

500 505 510 

Lys Glu Asn Asp Ser Glu Ser Met Asn Gly Gly Lys Leu lie Glu Phe 

515 520 525 

Ala Met Gin Leu Glu Asn lie Asp Ala Ser Gly Phe Gly Glu His lie 

530 535 540 

Pro Gly Glu Ser Leu Phe Pro Leu Gin Ser Val Glu Tyr Ser Lys lie 
545 550 555 560 

Val Ala Gin Leu Lys Pro Gly Glu Ser Glu Asp Val lie Leu Leu Ala 

565 570 575 

Cys Gin Lys Leu Leu Ser lie Phe Ser His Arg Pro Glu Gin Lys Gin 

580 585 590 

lie Tyr Met Ala Gin Asn Gly Phe Leu Pro Leu Met Glu Leu Leu Glu 

595 600 605 

lie Pro Lys Asn Arg lie Leu Cys Ser Val Leu Gin Leu lie Asn Asn 

610 615 620 

lie Val Lys Asp Ser Thr Gly Phe Leu Glu A.sn Ala Cys Leu Val Gly 
625 630 635 640 

Leu lie Pro Val Val Met Asn Phe Ala Glu Pro Asn Arg Pro Lys Asp 

645 650 655 

Val Arg Val Gin Ala Ser Leu Phe Leu Gin Gin Leu Cys Gin Ala Ser 

660 665 670 

Thr Leu Thr Leu Gin Met Phe lie Ala Cys Gin Gly lie Pro Val Leu 

675 680 685 

Val Ser Phe Met Glu Pro Asp Tyr Ala Lys Tyr Arg Asp Met Val His 

690 695 700 

Leu Ala lie Asp Gly lie Trp Gin Val Phe Lys Leu Gin His Ser Thr 
705 710 715 720 

Pro Arg Asn Asp Phe Cys Arg lie Ala Ala Lys Asn Gly Met Leu Pro 

725 730 735 

Arg Leu Val Asn Thr Leu Tyr Ser Leu Asn Glu Ala Thr Arg Phe Ala 

740 745 750 

Ser Val Ser Gly Ser Gly Ala Ser Val Thr Gin Asn Gly Ser Thr Pro 

755 760 765 

Arg Arg Arg Ser Gly Gin Leu Asp Pro Ser Val Leu Glu Ser Cys Lys 

775 780 
Ala Arg Leu Asp His His His Ser Ser Gly Ser Leu Gin Ser Leu Gin 
785 790 795 800 

Ala Asp Ala Asp Lys His His lie Leu Met Asp Ser Ser Ser Ser Pro 

805 810 815 

Arg Phe Ser Asp Lys Thr Ser Ser Asn Leu Glu Arg Asn Glu Asn Asp 

820 825 830 

Leu Val He Arg Pro Gin Arg Leu Ser Val Ser Ala Gly Arg Thr Ser 

835 840 845 

Thr Asp Arg Ser Pro Lys His lie Glu Leu Val Ser Asn Gly His Asn 

850 855 860 

Asn Ser Gin Asn Asp Gin Val Arg Pro Leu Leu Ser Leu Leu Glu Lys 
865 870 875 880 

Glu Pro Pro Ser Arg His Val Ser Gly Gin Leu Asp Tyr Val Arg His 

885 890 895 

He Ser Gly Leu Glu Arg His Glu Thr He Leu Pro Leu Leu His Thr 
900 905 910 
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Val 


His 






1155 








1160 








1165 






Gly Gly 


Leu 


Asp 


Val 


Tyr 


Leu 


Asn 


Leu 


Leu 


Glu 


Asp 


Asp 


Ala 


Trp 


Ala 




1170 








1175 








1180 








Cys 


Thr 


Ala 


Leu 


Asp 


Ser 


He 


Ala 


Val 


Cys 


Leu 


Ala 


His 


Asp 


Asn 


Asp 


1185 








1190 








1195 








120 


His 


Arg 


Lys 


Val 


Glu 


Gin 


Ala 


Leu 


Leu 


Lys 


Lys 


Glu 


Ala 


He 


Gin 


Lys 








1205 








1210 








1215 


Leu 


Val 


Lys 


Phe 


Phe 


Gin 


Asp 


Cys 


Pro 


Glu 


Gin 


Tyr 


Phe 


Gly 


His 


He 








1220 








1225 








1230 




Leu 


Asp 


Ala 


Phe 


Leu 


Lys 


He 


He 


Thr 


Lys 


Ser 


Ser 


Arg 


Leu 


Asn 


Thr 






1235 








1240 








1245 






Ala 


Met 


Ala 


Thr 


Asn 


Gly 


Leu 


Thr 


Thr 


Leu 


Leu 


He 


Ala 


Arg 


Leu 


Asp 




1250 








1255 








1260 








His 


Arg 


Glu 


Ala 


He 


Ala 


Arg 


Leu 


Thr 


Leu 


Leu 


Lys 


Leu 


He 


Lys 


Val 


1265 








1270 








1275 








128 


Val 


Tyr 


Glu 


His 


His 


Pro 


Arg 


Pro 


Lys 


Gin 


Leu 


He 


Val 


Glu 


Asn 


Asp 








1285 








1290 








1295 


Leu 


Pro 


Gin 


Lys 


Leu 


Gin 


Asn 


Leu 


He 


Glu 


Glu 


Arg 


Arg 


Asp 


Gly 


Gin 








1300 








1305 








1310 




Arg 


Gly Gly Gin Gin Val 


Leu 


Val 


Lys 


Gin 


Met 


Ala 


Thr 


Ser 


Leu 


Leu 






1315 








1320 








1325 






Lys 


Ala 


Leu 


His 


He 


Asn 


Thr 


Val 


Leu 

















1330 1335 



<210> 63 
<211> 20 
<:212> DNA 
<:213> 2ea mays 

<400> 63 
atggcgtcga ggcagcacaa 
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<210> 64 
<211> 20 
<:212> DNA 
<213> Zea mays 

<400> 64 

tcacaatact gtattgatgt - 20 

<210> 65 
<211> 1196 
<212> DNA 
<213> Zea mays 

<220> 
<221> CDS 

<222> (117) . . . (924) 
<400> 65 

ccaaaatcga ccttgtcgac ccggatccga gctaggcacc cgcacgacca cccgcctcct 60 
gccggtcacc gggacctagc atcgccgcgc ggcggcgggg gaggggaagg taggcg atg 119 

Met 
1 

agg aaa cgt gag egg gag aac ccg tgc ggg ate tgc ggg cac tac cac 167 
Arg Lys Arg Glu Arg GIu Asn Pro Cys Gly lie Cys Gly His Tyr His 
5 10 15 

aag tac gag gag ggg gag gta tgc ggg gtg tgc ggc cac cgc tgg aaa 215 
Lys Tyr Glu Glu Gly Glu Val Cys Gly Val Cys Gly His Arg Trp Lys 
20 25 30 

ccg teg gac ggg gag ggc gcc ccg teg agg cac gag tec gcc ttc ecc 263 
Pro Ser Asp Gly Glu Gly Ala Pro Ser Arg His Glu Ser Ala Phe Pro 
35 40 45 

aca gag gtg etc aag gac ttt etc ttc etc ggg age tac gac aac gcc 311 
Thr Glu Val Leu Lys Asp Phe Leu Phe Leu Gly Ser Tyr Asp Asn Ala 
50 55 60 65 

tea cgc tec gag gtc etc aga acg etc agt ate tec cac ate ett aat 359 
Ser Arg Ser Glu Val Leu Arg Thr Leu Ser lie Ser His lie Leu Asn 
70 75 80 - 

act gtg cet gat tgc cac aat ctg tac aag aat tea ttc aca tat cac 407 
Thr Val Pro Asp Cys His Asn Leu Tyr Lys A.sn Ser Phe Thr Tyr His 
85 90 95 

agt ctt caa egg gac agg cet ttg gat ttt gat gag gea aac caa tat 455 
Ser Leu Gin Arg Asp Arg Pro Leu Asp Phe Asp Glu Ala Asn Gin Tyr 
100 105 110 

eta gag aaa tgt gaa aga gat aaa teg cgt gtt ctt gtt cat tgc atg 503 
Leu Glu Lys Cys Glu Arg Asp Lys Ser Arg Val Leu Val His Cys Met 
115 120 125 

act gga aaa aac agg tet get get att gtt gea gcc ttc ttg atg aag 551 
Thr Gly Lys Asn Arg Ser Ala Ala lie Val Ala Ala Phe Leu Met Lys 
130 135 140 145 

tct aga ggg tgg aga ctt gea caa tet ttt caa tgg gtg aaa gat ega 599 
Ser Arg Gly Trp Arg Leu Ala Gin Ser Phe Gin Trp Val Lys Asp Arg 
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150 155 160 

aga cca cag gtt caa ctg aca gat get tct cag. aag gag ctt ttg gag 
Arg Pro Gin Val Gin Leu Thr Asp Ala Ser Gin Lys Glu Leu Leu Glu 
165 170 175 

tat gag ctg aag ctt ttc gga ccc agt act caa acc ttg gtt ccc act 
Tyr Glu Leu Lys Leu Phe Gly Pro Ser Thr Gin Thr Leu Val Pro Thr 
180 185 190 

gaa tea ttt get tct ctt gga ttt ggt tat aca aaa cca gca ggt gae 
Glu Ser Phe Ala Ser Leu Gly Phe Gly Tyr Thr Lys Pro Ala Gly Asp 
195 200 205 

atg caa acg ect gtg ttc aac caa acg acc atg cca tec ata ttc gag 
Met Gin Thr Pro Val Phe Asn Gin Thr Thr Met Pro Ser lie Phe Glu 
210 215 220 225 

cga ttc aac cca aat gat cgc ccc acc aat ttc act ttt gga gtt gag 
Arg Phe Asn Pro Asn Asp Arg Pro Thr Asn Phe Thr Phe Gly Val Glu 
230 235 240 

agg aaa gca gaa get aac ect cga gat aac aat agt gat ggc agg gta 
Arg Lys Ala Glu Ala Asn Pro Arg Asp Asn Asn Ser Asp Gly Arg Val 
245 250 255 

aac cca act tea acg gag aac ctg atg gae age tct t aagatgattt 
Asn Pro Thr Ser Thr Glu Asn Leu Met Asp Ser Ser 
260 265 



647 



695 



743 



791 



839 



887 



934 



cacgatgteg ctgttgaatc tccttgttcc agaagtaaac ccgctgtaat ctgcatgtcg 994 

cggaagtaaa ectactgctc ggtgcctcgg tgcaccgaat acttatccgg tgtctaggtt 1054 

tcagggaggt tttgcccttg tegcatgaga aatggttctc tggaaaccga tctttgcttt 1114 

agaatgactg tatgaaatag atgtttgtgg ctaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1174 

aaaaaaaaaa aaaaaaaaaa aa 1196 

<210> 66 

<211> 269 

<212> PRT 

<213> Zea mays 

<400> 66 

Met Arg Lys Arg Glu Arg Glu Asn Pro Cys Gly lie Cys Gly His Tyr 

15 10 15 

His Lys Tyr Glu Glu Gly Glu Val Cys Gly Val Cys Gly His Arg Trp , 

20 25 30 

Lys Pro Ser Asp Gly Glu Gly Ala Pro Ser Arg His Glu Ser Ala Phe 

35 40 45 

Pro Thr Glu Val Leu Lys Asp Phe Leu Phe Leu Gly Ser Tyr Asp Asn 

50 55 . 60 

Ala Ser Arg Ser Glu Val Leu Arg Thr Leu Ser lie Ser His lie Leu 
65 70 75 80 

Asn Thr Val Pro Asp Cys His Asn Leu Tyr Lys Asn Ser Phe Thr Tyr 

85 90 95 

His Ser Leu Gin Arg Asp Arg Pro Leu Asp Phe Asp Glu Ala Asn Gin 

100 105 110 

Tyr Leu Glu Lys Cys Glu Arg Asp Lys Ser Arg Val Leu Val His Cys 

115 120 125 

Met Thr Gly Lys Asn Arg Ser Ala Ala lie Val Ala Ala Phe Leu Met 

130 135 140 

Lys Ser Arg Gly Trp Arg Leu Ala Gin Ser Phe Gin Trp Val Lys Asp 
145 150 155 160 
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Arg 


Arg 


Pro 


Gin 


Val 
165 


Gin 


Leu 


Thr 


Asp 


Ala 
170 


Ser 


Gin 


Lys 


Glu 


Leu 
175 


Leu 


Glu 


Tyr 


Glu 


Leu 
180 


Lys 


Leu 


Phe 


Gly 


Pro 
185 


Ser 


Thr 


Gin 


Thr 


Leu 
190 


Val 


Pro 


Thr 


Glu 


Ser 
195 


Phe 


Ala 


Ser 


Leu 


Gly 
200 


Phe 


Gly 


Tyr 


Thr 


Lys 
205 


Pro 


Ala 


Gly 


Asp 


Met 
210 


Gin 


Thr 


Pro 


Val 


Phe 
215 


Asn 


Glh 


Thr 


Thr 


Met 
220 


Pro 


Ser 


He 


Phe 


Glu 


Arg 


Phe 


Asn 


Pro 


Asn 


Asp 


Arg 


Pro 


Thr 


Asn 


Phe 


Thr 


Phe 


Gly 


Val 


225 










230 










235 










240 


Glu 


Arg 


Lys 


Ala 


Glu 
245 


Ala 


Asn 


Pro 


Arg 


Asp 
250 


Asn 


Asn 


Ser 


Asp 


Gly 
255 


Arg 


Val 


Asn 


Pro 


Thr 
260 


Ser 


Thr 


Glu 


Asn 


Leu 
265 


Met 


Asp 


Ser 


Ser 









<210> 67 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 67 

atgaggaaac gtgagcggga gaacc 25 

<210> 68 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 68 

ttaagagctg tccatcaggt tctcc 25 

<210> 69 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Designed oligonucleotide based upon the adapter 

sequence and poly T to remove clones which have a 
poly A tail but no cDNA. 

<400> 69 

tcgacccacg cgtccgaaaa aaaaaaaaaa aaaaaa 36 
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Box I Observations where certain claims wer found uns archable (Continuation of Item 1 of first sheet) 



This International Search Report has not been established in respect of c rtain claims under Article 17(2)(a) for the following reasons: 

1. Claims Nos.: * - 

— because they relate to subject matter not required to be searched by this Authority, namely: 



2. Claims Nos.: . .^^ ■•_ ^ . * * u 

because they relate to parts of the International Application that do not comply with the prescnbed requirements to such 

an extent that no meaningful International Search can be earned out. specifically: 



^* ^ — ^ Sicause ttfey are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 

1 , j j As all required additional search fees were timely paid by the applicant, this Intemational Search Report covers all 
' ' searchable claims. 

2, I [ As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3, As only some of tf>e required additional search fees were timely paid by the applicant, this Intemational Search Report 
LAJ covers only those claims for which fees were paid, specifically claims Nos.: 

1-33 all partially, subject groups 1 and 3. 



4. I I No requireO additional search fees were timely paid by the applicant. Consequently, this Intemational Search Report is 
— restricted to the »ivontion first mentioned in the claims; it is covered by claims Nos.: 



R mark on Pr test [ | The additional search fees wer accompanied by th applicant's protest. 

I X| No protest accompanied the payment of additional search fees. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:1 and 2. polynucleotides and 
proteins related to said sequences as defined by the claims 
recombinant expression cassettes, host eel Is, transgenic * 
SlSJH^'^f'' ^7 modu ation of signal transduction gene 
expression, based on said SEQ ID N0S:1 and 2 and said 
related polynucleotides and proteins. 

2. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:5 and 6, polynucleotides and 
proteins related to said sequences as defined by the claims 
recombinant expression cassettes, host cell s, t raXs genie ' 
plants,methods for modulation of signal traAsductioS gene 
expression, based on said SEQ ID NOS:S and 6 and said 
related polynucleotides and proteins 

3. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 

D?S2firiff^*5X ^"^ polynucleotides and 

proteins related to said sequences as defined by the claims 
recombinant expression cassettes. host cellsTtraisgenic * 
plants,niethods for modulation of signal transduction Sene 
expression, based on said SEQ ID N0S:9 and 10 and sSid 
related polynucleotides and proteins. 

4. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 

Sj^rSififff.*'^ ID N0S:13 and 14. polynucleotides and 
proteins related to said sequences as defined by the claims 
recombinant expression cassettes. host cells.traflsgenic * 
pi ants. methods for modulation of signal transduction gene 
expression, based on said SEQ ID N0S:13 and 14 and said 
related polynucleotides and proteins. 

5. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:17 and 18, polynucleotides and 
proteins related to said sequences as defined by the cU?ms 
recombinant expression cassettes. host eel Is. transgenic 
pi ants, methods for modulation of signal transduction gene 
expression, based on s aid SEQ ID N0S:17 and 18 and said 

page 1 of 4 



BNSDOCID: <WO 00700b9A3 1 > 



INTERNATIONAL SEARCH REPORT 



International Application No. PCT/US 00/11687 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



related polynucleotides and proteins. 



6. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:21 and 22, polynucleotides and 
proteins related to said sequences as defined by the claims, 
recombinant expression cassettes, host cells, transgenic 
pi ants .methods for modulation of signal transduction gene 
expression, based on said SEQ ID N0S:21 and 22 and said 
related polynucleotides and proteins. 



7. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:25 and 26, polynucleotides and 
proteins related to said sequences as defined by the claims, 
recwnbinant expression cassettes, host cells, transgenic 
pi ants, methods for modulation of signal transduction gene 
expression, based on said SEQ ID NDS:25 and 26 and said 
related polynucleotides and proteins. 



8. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:29 and 30, polynucleotides and 
proteins related to said sequences as defined by the claims, 
recombinant expression cassettes, host cells, transgenic 
pi ants, methods for modulation of signal transduction gene 
expression, based on said SEQ ID N0S:29 and 30 and said 
related polynucleotides and proteins. 



9. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:33 and 34, polynucleotides and 
proteins related to said sequences as defined by the claims, 
recombinant expression cassettes, host cells, transgenic 
pi ants, methods for modulation of signal transduction gene 
expression, based on said SEQ ID N0S:33 and 34 and said 
related polynucleotides and proteins. 



10. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:37 and 38, polynucleotides and 
proteins related to said sequences as defined by the claims, 
recoirt)inant expression cassettes, host eel Is, transgenic 
plants, methods for modulation of signal transduction gene 
expression, based on said SEQ ID N0S:37 and 38 and said 
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related polynucleotides and proteins. 



11. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:41 and 42* polynucleotides and 
proteins related to said sequences as defined by the claims, 
recombinant expression cassettes *host cells, transgenic 
pi ants, methods for modulation of signal transduction gene 
expression, based on said SEQ ID N0S:41 and 42 and said 
related polynucleotides and proteins. 



12. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:45 and 46, polynucleotides and 
proteins related to said sequences as defined by the claims, 
recOTibinant expression cassettes, host eel Is, transgenic 
pi ants, methods for modulation of signal transduction gene 
expression, based on said SEQ ID N0S:45 and 46 and said 
related polynucleotides and proteins. 



13. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:49 and 50, polynucleotides and 
proteins related to said sequences as defined by the claims, 
recombinant expression cassettes ^host cells^transgenic 
pi ants, methods for modulation of signal transduction gene 
expression, based on said SEQ ID N0S:49 and 50 and said 
related polynucleotides and proteins. 



14. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:53 and 54, polynucleotides and 
proteins related to said sequences as defined by the claims, 
recombinant expression cassettes, host eel Is, transgenic 
pi ants, methods for modulation of signal transduction gene 
expression, based on said SEQ ID N0S:53 and 54 and said 
related polynucleotides and proteins. 



15. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:57 and 58, polynucleotides and 
proteins related to said sequences as defined by the claims, 
recombinant expression cassettes, host eel Is, transgenic 
pi ants, methods for modulation of signal transduction gene 
expression, based on said SEQ ID N0S:57 and 58 and said 
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related polynucleotides and proteins. 



16. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:61 and 62. polynucleotides and 
proteins related to said sequences as defined by the claims, 
recombinant expression cassettes. host cells. transgenic 
pi ants .methods for modulation of signal transduction gene 
expression, based on said SEQ ID N0S:61 and 62 and said 
related polynucleotides and proteins. 



17. Claims: 1-13 all partially 

Maize nucleic acid and corresponding protein as 
characterised by SEQ ID N0S:65 and 66, polynucleotides and 
proteins related to said sequences as defined by the claims, 
recombinant expression cassettes, host eel Is. transgenic 
pi ants .methods for modulation of signal transduction gene 
expression, based on said SEQ ID N0S:6S and 66 and said 
related polynucleotides and proteins. 
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